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PROTEINS REGULATING GENE EXPRESSION 



TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of proteins regulating 
gene expression and to the use of these sequences in the diagnosis, treatment, and prevention of 
reproductive disorders, nervous disorders, and diseases associated with cell proliferation and 
differentiation, including cancer, immune disorders, and developmental disorders. 



! 0 BACKGROUND OF THE INVENTION 

Multicellular organisms are comprised of diverse cell types that differ dramatically both in 
structure and function. The identity of a cell is determined by its characteristic pattern of gene 
expression, and different cell types express overlapping but distinctive sets of genes throughout 
development. Spatial and temporal regulation of gene expression is critical for the control of cell 
15 proliferation, cell differentiation, apoptosis, senescence, and other cellular processes that 

contribute to organismal development. Furthermore, gene expression is regulated in response to 
extracellular signals that mediate cell-cell communication and coordinate the activities of different 
cell types. Appropriate gene regulation also ensures that cells function efficiently by expressing 
only those genes whose functions are required at a given time. 
20 Transcription Factors 

Transcriptional regulatory proteins are essential for the control of gene expression. Some 
of these proteins function as transcription factors that initiate, activate, repress, or terminate gene 
transcription. Transcription factors generally bind to the promoter, enhancer, and upstream 
regulatory regions of a gene in a sequence-specific manner, although some factors bind regulatory 
25 elements within or downstream of the internal coding region of a gene. Transcription factors may 
bind to a specific region of DNA singly or as a complex with other accessory factors. (Reviewed 
in Lewin, B. (1990) Genes IV , Oxford University Press, New York, NY, pp. 554-570.) 

The double helix structure and repeated sequences of DNA create topological and 
chemical features which can be recognized by transcription factors. These features are hydrogen 
30 bond donor and acceptor groups, hydrophobic patches, major and minor grooves, and regular, 
repeated stretches of sequence which induce distinct bends in the helix. Typically, transcription 
factors recognize specific DNA sequence motifs about 20 nucleotides in length or less. Multiple, 
adjacent transcription factor-binding motifs may be required for gene regulation. 

Many transcription factors incorporate DNA-binding structural motifs which comprise 
35 either a helices or B sheets that bind to the major groove of DNA. Four well-characterized 
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structural motifs are helix- turn-helix, zinc finger, leucine zipper, and helix-loop-helix. Proteins 
containing these motifs may act alone as monomers, or they may form homo- or heterodimers that 
interact with DNA. 

The helix-turn-helix motif consists of two a helices connected at a fixed angle by a short 
5 chain of amino acids. One of the helices binds to the major groove. Helix-turn-helix motifs are 
exemplified by the homeobox motif which is present in homeodomain proteins. These proteins 
are critical for specifying the anterior-posterior body axis during development and are conserved 
throughout the animal kingdom. The Antennapedia and Uitrabithorax proteins o f Drosophila 
melanogaster are prototypical homeodomain proteins (Pabo, CO. and R.T. Sauer (1992) Ann. 

10 Rev. Biochem. 61:1053-1095). 

The zinc finger motif, which binds zinc ions, generally contains tandem repeats of about 
30 amino acids consisting of periodically spaced cysteine and histidine residues. Examples of this 
sequence pattern, designated C2H2 and C3HC4, have been described (Lewin, supra.). Zinc finger 
proteins each contain an a helix and an antiparallel 6 sheet whose proximity and conformation are 

15 maintained by the zinc ion. Contact with DNA is made by the arginine preceding the a helix and 
by the second, third, and sixth residues of the a helix. Variants of the zinc finger motif include 
poorly defined cysteine-rich motifs which bind zinc or other metal ions. These motifs may not 
contain histidine residues and are generally nonrepetitive. 

The bromodomain signature is an additional conserved region of about 70 amino acids 

20 found in a number of transcriptional regulatory proteins (ExPASy PROSITE document PS00633; 
Haynes, S.R. et al. (1992) Nucleic Acids Res. 20:2603). Although the exact function of this 
domain is unclear, it is found in the DNA-binding region of the thyroid hormone receptor 
coactivating protein. The thyroid hormone receptor is a member of the steroid/thyroid receptor 
superfamily that regulates the expression of many target genes through binding to thyroid 

25 hormone response elements (Tsuyoshi, M. et al (1997) J. Biol. Chem. 272:29834-29841). The 
bromodomain signature is also found in eukaroytic transcriptional initiation factor, TFIID, a 
protein essential for progression of the Gl phase of the cell cycle. 

The leucine zipper motif comprises a stretch of amino acids rich in leucine which can 
form an amphipathic a helix. This structure provides the basis for dimerization of two leucine 

30 zipper proteins. The region adjacent to the leucine zipper is usually basic, and upon protein 
dimerization, is optimally positioned for binding to the major groove. 

The helix-loop-helix motif consists of a short a helix connected by a loop to a longer a 
helix. The loop is flexible and allows the two helices to fold back against each other and to bind to 
DNA. The transcription factor Myc contains a prototypical HLH motif. 
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Most transcription factors contain characteristic DNA binding motifs including, but not 
limited to, those described above. Variations on the above motifs and new motifs have been and 
are currently being characterized (Faisst, S. and S. Meyer (1992) Nucl. Acids Res. 20:3-26). 
Chromatin Associated Proteins 
5 In the nucleus, DNA is packaged into chromatin, the compact organization of which limits 

the accessibility of DNA to transcription factors and plays a key role in gene regulation (Lewin, 
supra , pp. 409-410). The compact structure of chromatin is determined and influenced by 
chromatin-associated proteins such as the histones, the high mobility group (HMG) proteins, and 
the chromodornain proteins. There are five classes of histones, HI, H2A, H2B, H3, and H4, all of 

10 which are highly basic, low molecular weight proteins. The fundamental unit of chromatin, the 
nucleosome, consists of 200 base pairs of DNA associated with two copies each of H2A, H2B, 
H3, and H4. HI links adjacent nucleosomes. HMG proteins are low molecular weight, 
non-histone proteins that may play a role in unwinding DNA and stabilizing single-stranded DNA. 
Chromodornain proteins play a key role in the formation of highly compacted heterochromatin, 

15 which is transcriptionally silent. 
RNA-Associated Proteins 

Much of the regulation of gene expression in eukaroytic cells occurs at the 
posttranscriptional level. Messenger RNAs (mRNA) which are produced in the cell nucleus from 
primary transcripts of protein -encoding genes are processed and transported to the cytoplasm 

20 where the protein synthesis machinery is located. RNA-associated proteins are a group of proteins 
that participate in the processing, splicing, editing, transport, localization, translation, stability, and 
posttranscriptional regulation of mRNAs. Such proteins include RNA helicases, splicing factors, 
nucleases, and translational regulatory proteins. In addition, the nucleolus is a highly organized 
subcompartment of the nucleus which contains protein machinery specifically dedicated to the 

25 transcription and processing of ribosomal RNAs. The RNA-binding activity of these proteins is 
mediated by a series of RNA-binding motifs identified within them. These domains include the 
RNP motif, the arginine-rich motif, the RGG box, and the KH motif. (Reviewed in Burd, C. G. 
and Dreyfuss, G. (1994) Science 265:615 - 621 .) The RNP motif is the most widely found 
and best characterized of these motifs. It is composed of 90-100 amino acids which form an 

30 RNA-binding domain, and is found in one or more copies in proteins that bind pre-mRNA, 

mRNA, pre-ribosomal RNA, and small nuclear RNAs. The RNP motif is composed of two short 
sequences (RNP-1 and RNP-2) and a number of other mostly hydrophobic, conserved amino acids 
interspersed throughout the motif (Burd, supra; ExPASy PROSITE document PD0C0030). 
Diseases and disorders related to gene regulation 
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Many neoplastic disorders in humans can be attributed to inappropriate gene expression. 
Malignant cell growth may result from either excessive expression of tumor promoting genes or 
insufficient expression of tumor suppressor genes (Cleary, M.L. (1992) Cancer Surv. 15:89-104). 
Chromosomal translocations may also produce chimeric loci which fuse the coding sequence of 

5 one gene with the regulatory regions of a second unrelated gene. Such an arrangement likely 
results in inappropriate gene transcription. The Wilms tumor suppressor gene product, WT1, is a 
protein containing a DNA-binding domain consisting of four zinc fingers and a proline-glutamine 
rich region capable of regulating transcription (ExPASy PROSITE document PR00049). 
Deletions of the WT1 gene, or point mutations which destroy the DNA-binding activity of the 

10 protein, are assiociated with development of the pediatric nephroblastoma, Wilms tumor, and 
Denys-Drash syndrome (Rauscher, F.J. (1993) FASEB J. 7:896-903). 

Certain proteins enriched in glutamine are associated with various neurological disorders 
including spinocerebellar ataxia, bipolar effective disorder, schizophrenia and autism (Margolis, 
R.L. et ai. (1997) Human Genetics 100:1 14-122). These proteins contain regions with as many as 

15 15 or more consecutive glutamine residues and may function as transcription factors with a 
potential role in regulation of neurodevelopment or neuroplasticity. 

The immune system responds to infection or trauma by activating a cascade of events that 
coordinate the progressive selection, amplification, and mobilization of cellular defense 
mechanisms. A complex and balanced program of gene activation and repression is involved in 

20 this process. However, hyperactivity of the immune system as a result of improper or insufficient 
regulation of gene expression may result in considerable tissue or organ damage. This damage is 
well documented in immunological responses associated with arthritis, allergens, heart attack, 
stroke, and infections (Harrison's Principles of Internal Medicine , 13/e, McGraw Hill, Inc. and 
Teton Data Systems Software, 1996). In particular, a zinc finger protein termed Staf50 (for 

25 Stimulated trans-acting factor of 50 kDa) is a transcriptional regulator and is induced in various 
cell lines by interferon-I and -II. Staf50 appears to mediate the antiviral activity of interferon by 
down-regulating the viral transcription directed by the long terminal repeat promoter region of 
human immunodeficiency virus type-1 in transfected cells (Tissot, C. (1995) J. Biol. Chem. 
270:14891-14898). 

30 Furthermore, the generation of multicellular organisms is based upon the induction and 

coordination of cell differentiation at the appropriate stages of development. Central to this 
process is differential gene expression, which confers the distinct identities of cells and tissues 
throughout the body. Failure to regulate gene expression during development could result in 
developmental disorders. 

35 The discovery of new proteins regulating gene expression and the polynucleotides 
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encoding them satisfies a need in the art by providing new compositions which are useful in the 
diagnosis, prevention, and treatment of reproductive disorders, nervous disorders, and diseases 
associated with cell proliferation and differentiation, including cancer, immune disorders, and 
developmental disorders. 

5 

SUMMARY OF THE INVENTION 

The invention features substantially purified polypeptides, proteins regulating gene 
expression, referred to collectively as "PRGE" and individually as "PRGE-l", "PRGE-2", 

10 "PRGE-3", "PRGE-4", "PRGE-5", "PRGE-6", "PRGE-7", "PRGE-8*', "PRGE-9", "PRGE-10", 
"PRGE- 11", "PRGE-12", "PRGE-13", "PRGE-14", "PRGE-15", "PRGE-16", "PRGE- 17", 
"PRGE-18", "PRGE-19", "PRGE-20", "PRGE-21", "PRGE-22", "PRGE-23", "PRGE-24", 
"PRGE-25", "PRGE-26", "PRGE-27", "PRGE-28", "PRGE-29", "PRGE- 3 0 ",and "PRGE-31". In 
one aspect, the invention provides a substantially purified polypeptide comprising an amino acid 

15 sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 13, SEQ ID NO: 1 4, SEQ ID NO: 1 5, SEQ 
ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:20, SEQ ID NO:2 1 , 
SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID 

20 NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:3 1 (SEQ ID NO: 1-31), and 
fragments thereof 

The invention further provides a substantially purified variant having at least 90% amino 
acid identity to at least one of the amino acid sequences selected from the group consisting of SEQ 
ID NO: 1-3 1, and fragments thereof The invention also provides an isolated and purified 

25 polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-3 1, and fragments thereof. The invention also includes an 
isolated and purified polynucleotide variant having at least 90% polynucleotide sequence identity 
to the polynucleotide encoding the polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-3 1, and fragments thereof. 

30 Additionally, the invention provides an isolated and purified polynucleotide which 

hybridizes under stringent conditions to the polynucleotide encoding the polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-3 1 , and fragments 
thereof. The invention also provides an isolated and purified polynucleotide having a sequence 
which is complementary to the polynucleotide encoding the polypeptide comprising the amino 

35 acid sequence selected from the group consisting of SEQ ID NO: 1-3 1 , and fragments thereof 
The invention also provides an isolated and purified polynucleotide comprising a 
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polynucleotide sequence selected from the group consisting of SEQ ID NO:32, SEQ ID NO:33, 
SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID 
NO:39, SEQ ID NO:40, SEQ ID NO:41 , SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ 
ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, 
5 SEQ ID NO:5 1 , SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID 
NO:56, SEQ IDNO:57, SEQ IDNO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ 
ID NO:62 (SEQ ID NO:32-62), and fragments thereof. The invention further provides an isolated 
and purified polynucleotide variant having at least 90% polynucleotide sequence identity to the 
polynucleotide sequence selected from the group consisting of SEQ ID NO:32-62, and fragments 
10 thereof The invention also provides an isolated and purified polynucleotide having a sequence 
which is complementary to the polynucleotide comprising a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:32-62, and fragments thereof. 

The invention also provides a method for detecting a polynucleotide in a sample 
containing nucleic acids, the method comprising the steps of (a) hybridizing the complement of the 
15 polynucleotide sequence to at least one of the polynucleotides of the sample, thereby forming a 
hybridization complex; and (b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of a polynucleotide in the sample. In one 
aspect, the method further comprises amplifying the polynucleotide prior to hybridization. 

The invention further provides an expression vector containing at least a fragment of the 
20 polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-3 1, and fragments thereof In another aspect, the expression 
vector is contained within a host cell. 

The invention also provides a method for producing a polypeptide, the method comprising 
the steps of: (a) culturing the host cell containing an expression vector containing at least a 
25 fragment of a polynucleotide under conditions suitable for the expression of the polypeptide; and 
(b) recovering the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially 
purified polypeptide having the amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-31, and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 
30 The invention further includes a purified antibody which binds to a polypeptide selected 

from the group consisting of SEQ ID NO: 1-31, and fragments thereof The invention also 
provides a purified agonist and a purified antagonist to the polypeptide. 

The invention also provides a method for treating or preventing a disorder associated with 
decreased expression or activity of PRGE, the method comprising administering to a subject in 
35 need of such treatment an effective amount of a pharmaceutical composition comprising a 
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substantially purified polypeptide having the amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-31, and fragments thereof, in conjunction with a suitable 
pharmaceutical carrier. 

The invention also provides a method for treating or preventing a disorder associated with 
5 increased expression or activity of PRGE, the method comprising administering to a subject in 
need of such treatment an effective amount of an antagonist of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-31, and fragments thereof. 

BRIEF DESCRIPTION OF THE TABLES 

10 Table 1 shows nucleotide and polypeptide sequence identification numbers (SEQ ID NO), 

clone identification numbers (clone ID), cDNA libraries, and cDNA fragments used to assemble 

full-length sequences encoding PRGE. 

Table 2 shows features of each polypeptide sequence including potential motifs, 

homologous sequences, and methods and algorithms used for identification of PRGE. 
15 Table 3 shows the tissue-specific expression patterns of each nucleic acid sequence as 

determined by northern analysis, diseases, disorders, or conditions associated with these tissues, 

and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which Incyte 

cDNA clones encoding PRGE were isolated. 
20 Table 5 shows the programs, their descriptions, references, and threshold parameters used 

to analyze PRGE. 



DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 
25 understood that this invention is not limited to the particular machines, materials and methods 

described, as these may vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," 
30 "an," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 
antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled 
in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
35 meanings as commonly understood by one of ordinary skill in the art to which this invention 
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belongs. Although any machines, materials, and methods similar or equivalent to those described 
herein can be used to practice or test the present invention, the preferred machines, materials and 
methods are now described. All publications mentioned herein are cited for the purpose of 
describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
5 publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 
prior invention. 
DEFINITIONS 

"PRGE" refers to the amino acid sequences of substantially purified PRGE obtained from 
10 any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, 
and preferably the human species, from any source, whether natural, synthetic, semi-synthetic, or 
recombinant. 

The term "agonist" refers to a molecule which, when bound to PRGE, increases or 
prolongs the duration of the effect of PRGE. Agonists may include proteins, nucleic acids, 

15 carbohydrates, or any other molecules which bind to and modulate the effect of PRGE. 

An "allelic variant" is an alternative form of the gene encoding PRGE. Allelic variants 
may result from at least one mutation in the nucleic acid sequence and may result in altered 
mRNAs or in polypeptides whose structure or function may or may not be altered. Any given 
natural or recombinant gene may have none, one, or many allelic forms. Common mutational 

20 changes which give rise to allelic variants are generally ascribed to natural deletions, additions, or 
substitutions of nucleotides. Each of these types of changes may occur alone, or in combination 
with the others, one or more times in a given sequence. 

"Altered" nucleic acid sequences encoding PRGE include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polynucleotide the same as 

25 PRGE or a polypeptide with at least one functional characteristic of PRGE. Included within this 
definition are polymorphisms which may or may not be readily detectable using a particular 
oligonucleotide probe of the polynucleotide encoding PRGE, and improper or unexpected 
hybridization to allelic variants, with a locus other than the normal chromosomal locus for the 
polynucleotide sequence encoding PRGE. The encoded protein may also be "altered," and may 

30 contain deletions, insertions, or substitutions of amino acid residues which produce a silent change 
and result in a functionally equivalent PRGE. Deliberate amino acid substitutions may be made on 
the basis of similarity in polarity, charge, solubility, hydroph obi city, hydrophilicity, and/or the 
amphipathic nature of the residues, as long as the biological or immunological activity of PRGE is 
retained. For example, negatively charged amino acids may include aspartic acid and glutamic 

35 acid, positively charged amino acids may include lysine and arginine, and amino acids with 
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uncharged polar head groups having similar hydrophilicity values may include leucine, isoleucine, 
and valine; glycine and alanine; asparagine and glutamine; serine and threonine; and 
phenylalanine and tyrosine. 

The terms "amino acid" or "amino acid sequence" refer to an oligopeptide, peptide, 
5 polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. In this context, "fragments," "immunogenic fragments," or "antigenic 
fragments" refer to fragments of PRGE which are preferably at least 5 to about 15 amino acids in 
length, most preferably at least 14 amino acids, and which retain some biological activity or 
immunological activity of PRGE. Where "amino acid sequence" is recited to refer to an amino 
10 acid sequence of a naturally occurring protein molecule, "amino acid sequence" and like terms are 
not meant to limit the amino acid sequence to the complete native amino acid sequence associated 
with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
15 known in the art. 

The term "antagonist" refers to a molecule which, when bound to PRGE, decreases the 
amount or the duration of the effect of the biological or immunological activity of PRGE. 
Antagonists may include proteins, nucleic acids, carbohydrates, antibodies, or any other molecules 
which decrease the effect of PRGE. 
20 The term "antibody" refers to intact molecules as well as to fragments thereof, such as 

Fab, F(ab') 2 , and Fv fragments, which are capable of binding the epitopic determinant. Antibodies 
that bind PRGE polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide 
used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of 
25 RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly 
used carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, 
and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that fragment of a molecule (i.e., an epitope) 
that makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
30 immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (given regions or three-dimensional structures on 
the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition containing a nucleic acid sequence which 
35 is complementary to the "sense" strand of a specific nucleic acid sequence. Antisense molecules 

_ 9 _ 
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may be produced by any method including synthesis or transcription. Once introduced into a ceil, 
the complementary nucleotides combine with natural sequences produced by the cell to form 
duplexes and to block either transcription or translation. The designation "negative" can refer to 
the ant i sense strand, and the designation "positive" can refer to the sense strand. 
5 The term "biologically active," refers to a protein having structural, regulatory, or 

biochemical functions of a naturally occurring molecule. Likewise, "immunologically active" 
refers to the capability of the natural, recombinant, or synthetic PRGE, or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

10 The terms "complementary" or "complementarity" refer to the natural binding of 

polynucleotides by base pairing. For example, the sequence "5* A-G-T 3 f " bonds to the 
complementary sequence "3* T-C-A 5'." Complementarity between two single-stranded molecules 
may be "partial," such that only some of the nucleic acids bind, or it may be "complete," such that 
total complementarity exists between the single stranded molecules. The degree of 

15 complementarity between nucleic acid strands has significant effects on the efficiency and strength 
of the hybridization between the nucleic acid strands. This is of particular importance in 
amplification reactions, which depend upon binding between nucleic acids strands, and in the 
design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" or a "composition 

20 comprising a given amino acid sequence" refer broadly to any composition containing the given 
polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 
aqueous solution. Compositions comprising polynucleotide sequences encoding PRGE or 
fragments of PRGE may be employed as hybridization probes. The probes may be stored in 
freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. In 

25 hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., NaCl), 
detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's solution, 
dry milk, salmon sperm DNA, etc.). 

"Consensus sequence'Yefers to a nucleic acid sequence which has been resequenced to 
resolve uncalled bases, extended using XL-PCR kit (Perkin-Elmer, Norwalk CT) in the 5' and/or 

30 the y direction, and resequenced, or which has been assembled from the overlapping sequences of 
more than one Incyte Clone using a computer program for fragment assembly, such as the 
GEL VIEW Fragment Assembly system (GCG, Madison WI). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

The term "correlates with expression of a polynucleotide" indicates that the detection of 

35 the presence of nucleic acids, the same or related to a nucleic acid sequence encoding PRGE, by 
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northern analysis is indicative of the presence of nucleic acids encoding PRGE in a sample, and 
thereby correlates with expression of the transcript from the polynucleotide encoding PRGE. 

A "de!etion"refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 
5 The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 

polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
example, replacement of hydrogen by an alkyl, acyl, or amino group. A derivative polynucleotide 
encodes a polypeptide which retains at least one biological or immunological function of the 
natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 

10 similar process that retains at least one biological or immunological function of the polypeptide 
from which it was derived. 

The term "similarity" refers to a degree of complementarity. There may be partial 
similarity or complete similarity. The word "identity" may substitute for the word "similarity. " A 
partially complementary sequence that at least partially inhibits an identical sequence from 

15 hybridizing to a target nucleic acid is referred to as "substantially similar." The inhibition of 

hybridization of the completely complementary sequence to the target sequence may be examined 
using a hybridization assay (Southern or northern blot, solution hybridization, and the like) under 
conditions of reduced stringency. A substantially similar sequence or hybridization probe will 
compete for and inhibit the binding of a completely similar (identical) sequence to the target 

20 sequence under conditions of reduced stringency. This is not to say that conditions of reduced 
stringency are such that non-specific binding is permitted, as reduced stringency conditions 
require that the binding of two sequences to one another be a specific (i.e., a selective) interaction. 
The absence of non-specific binding may be tested by the use of a second target sequence which 
lacks even a partial degree of complementarity (e.g., less than about 30% similarity or identity). 

25 In the absence of non-specific binding, the substantially similar sequence or probe will not 
hybridize to the second non-complementary target sequence. 

The phrases "percent identity" or "% identity" refer to the percentage of sequence 
similarity found in a comparison of two or more amino acid or nucleic acid sequences. Percent 
identity can be determined electronically, e.g., by using the MEGALIGN program (DNASTAR, 

30 Madison WI) which creates alignments between two or more sequences according to methods 
selected by the user, e.g., the clustal method. (See, e.g., Higgins, D.G. and P.M. Sharp (1988) 
Gene 73 :237-244.) The clustal algorithm groups sequences into clusters by examining the 
distances between all pairs. The clusters are aligned pairwise and then in groups. The percentage 
similarity between two amino acid sequences, e.g., sequence A and sequence B, is calculated by 

35 dividing the length of sequence A, minus the number of gap residues in sequence A, minus the 
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number of gap residues in sequence B, into the sum of the residue matches between sequence A 
and sequence B, times one hundred. Gaps of low or of no similarity between the two amino acid 
sequences are not included in determining percentage similarity. Percent identity between nucleic 
acid sequences can also be counted or calculated by other methods known in the art, e.g., the Jotun 
5 Hein method. (See, e.g., Hein, J. (1990) Methods Enzymol. 183:626-645.) Identity between 
sequences can also be determined by other methods known in the art, e.g., by varying 
hybridization conditions. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may 
contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements 
10 required for stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to any process by which a strand of nucleic acid binds with a 
15 complementary strand through base pairing. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or R^t analysis) or formed between one 
nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a 
20 solid support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 
substrate to which cells or their nucleic acids have been fixed). 

The words "insertion" or "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively, 
to the sequence found in the naturally occurring molecule. 
25 "Immune response" can refer to conditions associated with inflammation, trauma, immune 

disorders, or infectious or genetic disease, etc. These conditions can be characterized by 
expression of various factors, e.g., cytokines, chemokines, and other signaling molecules, which 
may affect cellular and systemic defense systems. 

The term "rnicroarray" refers to an arrangement of distinct polynucleotides on a substrate. 
30 The terms "element" or "array element" in a rnicroarray context, refer to hybridizable 

polynucleotides arranged on the surface of a substrate. 

The term "modulate" refers to a change in the activity of PRGE. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties of PRGE. 
35 The phrases "nucleic acid" or "nucleic acid sequence," as used herein, refer to a 
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nucleotide, oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to 
DNA or RNA of genomic or synthetic origin which may be single-stranded or double-stranded 
and may represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any 
DNA-like or RNA-like material. In this context, "fragments" refers to those nucleic acid 
5 sequences which, comprise a region of unique polynucleotide sequence that specifically identifies 
SEQ ID NO:32-62, for example, as distinct from any other sequence in the same genome. For 
example, a fragment of SEQ ID NO:32-62 is useful in hybridization and amplification 
technologies and in analogous methods that distinguish SEQ ID NO:32-62 from related 
polynucleotide sequences. A fragment of SEQ ID NO:32-62 is at least about 1 5-20 nucleotides in 

10 length. The precise length of the fragment of SEQ ID NO:32-62 and the region of SEQ ID 

NO:32-62 to which the fragment corresponds are routinely determinable by one of ordinary skill 
in the art based on the intended purpose for the fragment. Alternatively, a fragment when 
translated, would produce polypeptides retaining some functional characteristic, e.g., antigenicity, 
or structural domain characteristic, e.g., ATP-binding site, of the full-length polypeptide. 

15 The terms "operably associated" or "operably linked" refer to functionally related nucleic 

acid sequences. A promoter is operably associated or operably linked with a coding sequence if 
the promoter controls the translation of the encoded polypeptide. While operably associated or 
operably linked nucleic acid sequences can be contiguous and in the same reading frame, certain 
genetic elements, e.g., repressor genes, are not contiguously linked to the sequence encoding the 

20 polypeptide but still bind to operator sequences that control expression of the polypeptide. 

The term "oligonucleotide" refers to a nucleic acid sequence of at least about 6 
nucleotides to 60 nucleotides, preferably about 15 to 30 nucleotides, and most preferably about 20 
to 25 nucleotides, which can be used in PCR amplification or in a hybridization assay or 
microarray. "Oligonucleotide" is substantially equivalent to the terms "amplimer," "primer/' 

25 "oligomer," and "probe," as these terms are commonly defined in the art. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 

30 elongation, and may be pegylated to extend their lifespan in the cell. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding PRGE, or fragments thereof, or PRGE itself, may comprise a bodily fluid; an 
extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic 
DNA, RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

35 The terms "specific binding" or "specifically binding" refer to that interaction between a 
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protein or peptide and an agonist, an antibody, or an antagonist, 
the presence 
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The interaction is dependent upon 



5 of a particular structure of the protein, e.g., the antigenic determinant or epitope, recognized by the 
binding molecule. For example, if an antibody is specific for epitope "A," the presence of a 
polypeptide containing the epitope A, or the presence of free unlabeled A, in a reaction containing 
free labeled A and the antibody will reduce the amount of labeled A that binds to the antibody. 

The term "stringent conditions" refers to conditions which permit hybridization between 

10 polynucleotides and the claimed polynucleotides. Stringent conditions can be defined by salt 
concentration, the concentration of organic solvent, e.g., formamide, temperature, and other 
conditions well known in the art. In particular, stringency can be increased by reducing the 
concentration of salt, increasing the concentration of formamide, or raising the hybridization 
temperature. 

15 The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

removed from their natural environment and are isolated or separated, and are at least about 60% 
free, preferably about 75% free, and most preferably about 90% free from other components with 
which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acids or nucleotides by 

20 different amino acids or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

25 "Transformation" describes a process by which exogenous DNA enters and changes a 

recipient cell. Transformation may occur under natural or artificial conditions according to 
various methods well known in the art, and may rely on any known method for the insertion of 
foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for 
transformation is selected based on the type of host cell being transformed and may include, but is 

30 not limited to, viral infection, electroporation, heat shock, lipofection, and particle bombardment. 
The term "transformed" cells includes stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for 
limited periods of time. 

35 A "variant" of PRGE polypeptides refers to an amino acid sequence that is altered by one 
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or more amino acid residues. The variant may have "conservative" changes, wherein a substituted 
amino acid has similar structural or chemical properties (e.g., replacement of leucine with 
isoleucine). More rarely, a variant may have "nonconservative" changes (e.g., replacement of 
glycine with tryptophan). Analogous minor variations may also include amino acid deletions or 
5 insertions, or both. Guidance in determining which amino acid residues may be substituted, 
inserted, or deleted without abolishing biological or immunological activity may be found using 
computer programs well known in the art, for example, LASERGENE software (DNASTAR). 

The term "variant," when used in the context of a polynucleotide sequence, may 
encompass a polynucleotide sequence related to PRGE. This definition may also include, for 

10 example, "allelic" (as defined above), "splice," "species," or "polymorphic" variants. A splice 
variant may have significant identity to a reference molecule, but will generally have a greater or 
lesser number of polynucleotides due to alternate splicing of exons during mRNA processing. The 
corresponding polypeptide may possess additional functional domains or an absence of domains. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 

15 polypeptides generally will have significant amino acid identity relative to each other. A 

polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The presence 
of SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for 

20 a disease state. 

THE INVENTION 

The invention is based on the discovery of new human proteins regulating gene expression 
(PRGE), the polynucleotides encoding PRGE, and the use of these compositions for the diagnosis, 
treatment, or prevention of reproductive disorders, nervous disorders, and diseases associated with 
25 cell proliferation and differentiation, including cancer, immune disorders, and developmental 
disorders. 

Table 1 lists the Incyte Clones used to derive full length nucleotide sequences encoding 
PRGE. Columns 1 and 2 show the sequence identification numbers (SEQ ID NO) of the amino 
acid and nucleic acid sequences, respectively. Column 3 shows the Clone ID of the Incyte Clone 

30 in which nucleic acids encoding each PRGE were identified, and column 4, the cDNA libraries 
from which these clones were isolated. Column 5 shows Incyte clones, their corresponding cDNA 
libraries, and shotgun sequences. The clones and shotgun sequences are part of the consensus 
nucleotide sequence of each PRGE. The regions of the full-length nucleotide sequence of each 
PRGE to which the clones and shotgun sequences correspond are listed in Column 5. The clones 

35 and fragments are useful as fragments in hybridization technologies. 
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The columns of Table 2 show various properties of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3, potential phosphorylation sites; column 4 ? potential glycosylation sites; 
column 5, the amino acid residues comprising signature sequences and motifs; column 6, the 
5 identity of each protein; and column 7, analytical methods used to identify each protein through 
sequence homology and protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding PRGE. The first column of Table 3 lists the 
nucleotide sequence identifiers. The second column lists tissue categories which express PRGE as 

10 a fraction of total tissue categories expressing PRGE. The third column lists the diseases, 

disorders, or conditions associated with those tissues expressing PRGE. The fourth column lists 
the vectors used to subclone the cDNA library. 

The following fragments of the nucleotide sequences encoding PRGE are useful in 
hybridization or amplification technologies to identify SEQ ID NO:56-62 and to distinguish 

15 between SEQ ID NO:56-62 and related polynucleotide sequences. The useful fragments are the 
fragment of SEQ ID NO: 56 from about nucleotide 1675 to about nucleotide 1719; the fragment of 
SEQ ID NO:57 from about nucleotide 379 to about nucleotide 423; the fragment of SEQ ID 
NO:58 from about nucleotide 596 to about nucleotide 640; the fragment of SEQ ID NO:59 from 
about nucleotide 219 to about nucleotide 263; the fragment of SEQ ID NO:60 from about 

20 nucleotide 732 to about nucleotide 776; the fragment of SEQ ID NO:61 from about nucleotide 197 
to about nucleotide 244; and the fragment of SEQ ID NO: 62 from about nucleotide 2 1 7 to about 
nucleotide 261 . 

The invention also encompasses PRGE variants. A preferred PRGE variant is one which 
has at least about 80%, more preferably at least about 90%, and most preferably at least about 95% 

25 amino acid sequence identity to the PRGE amino acid sequence, and which contains at least one 
functional or structural characteristic of PRGE. 

The invention also encompasses polynucleotides which encode PRGE. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence 
selected from the group consisting of SEQ ID NO:32-62, which encodes PRGE. 

30 The invention also encompasses a variant of a polynucleotide sequence encoding PRGE. 

In particular, such a variant polynucleotide sequence will have at least about 80%, more preferably 
at least about 90%, and most preferably at least about 95% polynucleotide sequence identity to the 
polynucleotide sequence encoding PRGE. A particular aspect of the invention encompasses a 
variant of a polynucleotide sequence comprising a sequence selected from the group consisting of 

35 SEQ ID NO:32-62 which has at least about 80%, more preferably at least about 90%, and most 

-16- 



WO 99/64596 PCTAJS99/13281 - 

preferably at least about 95% polynucleotide sequence identity to a nucleic acid sequence selected 
from the group consisting of SEQ ID NO:32-62. Any one of the polynucleotide variants described 
above can encode an amino acid sequence which contains at least one functional or structural 
characteristic of PRGE. 

5 It will be appreciated by those skilled in the art that as a result of the degeneracy of the 

genetic code, a multitude of polynucleotide sequences encoding PRGE, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 

10 combinations are made in accordance with the standard triplet genetic code as applied to the 

polynucleotide sequence of naturally occurring PRGE, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode PRGE and its variants are preferably 
capable of hybridizing to the nucleotide sequence of the naturally occurring PRGE under 

15 appropriately selected conditions of stringency, it may be advantageous to produce nucleotide 

sequences encoding PRGE or its derivatives possessing a substantially different codon usage, e.g., 
inclusion of non-natural ly occurring codons. Codons may be selected to increase the rate at which 
expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accordance with 
the frequency with which particular codons are utilized by the host. Other reasons for 

20 substantially altering the nucleotide sequence encoding PRGE and its derivatives without altering 
the encoded amino acid sequences include the production of RNA transcripts having more 
desirable properties, such as a greater half-life, than transcripts produced from the naturally 
occurring sequence. 

The invention also encompasses production of DNA sequences which encode PRGE and 
25 PRGE derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding PRGE or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
30 hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:32-62 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. 
and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
152:507-51 1 .) For example, stringent salt concentration will ordinarily be less than about 750 mM 
NaCl and 75 mM trisodium citrate, preferably less than about 500 mM NaCl and 50 mM trisodium 
35 citrate, and most preferably less than about 250 mM NaCl and 25 mM trisodium citrate. Low 
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stringency hybridization can be obtained in the absence of organic solvent, e.g., formamide, while 
high stringency hybridization can be obtained in the presence of at least about 35% formamide, 
and most preferably at least about 50% formamide. Stringent temperature conditions will 
ordinarily include temperatures of at least about 30°C, more preferably of at least about 37°C, and 
5 most preferably of at least about 42°C. Varying additional parameters, such as hybridization time, 
the concentration of detergent, e.g., sodium dodecyi sulfate (SDS), and the inclusion or exclusion 
of carrier DNA, are well known to those skilled in the art. Various levels of stringency are 
accomplished by combining these various conditions as needed. In a preferred embodiment, 
hybridization will occur at 30°C in 750 mM NaCl, 75 mM trisodium citrate, and 1% SDS. In a 
10 more preferred embodiment, hybridization will occur at 37°C in 500 mM NaCl, 50 mM trisodium 
citrate, 1% SDS, 35% formamide, and 100 ^g/ml denatured salmon sperm DNA (ssDNA). In a 
most preferred embodiment, hybridization will occur at 42°C in 250 mM NaCl, 25 mM trisodium 
citrate, 1% SDS, 50 % formamide, and 200 Mg/ml ssDNA. Useful variations on these conditions 
will be readily apparent to those skilled in the art. 
15 The washing steps which follow hybridization can also vary in stringency. Wash 

stringency conditions can be defined by salt concentration and by temperature. As above, wash 
stringency can be increased by decreasing salt concentration or by increasing temperature. For 
example, stringent salt concentration for the wash steps will preferably be less than about 30 mM 
NaCl and 3 mM trisodium citrate, and most preferably less than about 15 mM NaCl and 1.5 mM 
20 trisodium citrate. Stringent temperature conditions for the wash steps will ordinarily include 

temperature of at least about 25°C, more preferably of at least about 42°C, and most preferably of 
at least about 68°C. In a preferred embodiment, wash steps will occur at 25°C in 30 mM NaCl, 3 
mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps will occur at 
42°C in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. In a most preferred embodiment, 
25 wash steps will occur at 68°C in 15 mM NaCl, 1 .5 mM trisodium citrate, and 0.1% SDS. 
Additional variations on these conditions will be readily apparent to those skilled in the art. 

Methods for DNA sequencing are well known in the art and may be used to practice any 
of the embodiments of the invention. The methods may employ such enzymes as the Klenow 
fragment of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase 
30 (Perkin-Elmer), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 
combinations of polymerases and proofreading exonucleases such as those found in the 
ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 
preparation is automated with machines such as the Hamilton MICROLAB 2200 (Hamilton, Reno 
NV), Peltier Thermal Cycler 200 (PTC200; MJ Research, Watertown MA) and the ABI 
35 CATALYST 800 (Perkin-Elmer). Sequencing is then carried out using either ABI 373 or 377 
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DNA sequencing systems (Perkin-Elmer) or the MEGABACE 1000 DNA sequencing system 
(Molecular Dynamics, Sunnyvale CA). The resulting sequences are analyzed using a variety of 
algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) Short Protocols in 
Molecular Biology , John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular 
5 Biology and Biotechnology . Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding PRGE may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream 
sequences, such as promoters and regulatory elements. For example, one method which may be 
employed, re strict ion- site PCR, uses universal and nested primers to amplify unknown sequence 

10 from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 
2:31 8-322.) Another method, inverse PCR, uses primers that extend in divergent directions to 
amplify unknown sequence from a circularized template. The template is derived from restriction 
fragments comprising a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et 
al. ( 1 988) Nucleic Acids Res. 16:81 86.) A third method, capture PCR, involves PCR 

15 amplification of DNA fragments adjacent to known sequences in human and yeast artificial 

chromosome DNA. (See, e.g., Lagerstrom, M. et al. (1991) PCR Methods Applic. 1:111-1 19.) In 
this method, multiple restriction enzyme digestions and ligations may be used to insert an 
engineered double-stranded sequence into a region of unknown sequence before performing PCR. 
Other methods which may be used to retrieve unknown sequences are known in the art. (See, e.g., 

20 Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-306). Additionally, one may use PCR, 
nested primers, and PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic 
DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon 
junctions. For al! PCR-based methods, primers may be designed using commercially available 
software, such as OLIGO 4.06 Primer Analysis software (National Biosciences, Plymouth MN) or 

25 another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
about 50% or more, and to anneal to the template at temperatures of about 68°C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5* regions of genes, are preferable for situations in which an oligo d(T) 

30 library does not yield a full-length cDNA. Genomic libraries may be useful for extension of 
sequence into 5 1 non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to 
analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, 
capillary sequencing may employ flowable polymers for electrophoretic separation, four different 

35 nucleotide-specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for 
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detection of the emitted wavelengths. Output/light intensity may be converted to electrical signal 
using appropriate software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), 
and the entire process from loading of samples to computer analysis and electronic data display 
may be computer controlled. Capillary electrophoresis is especially preferable for sequencing 
5 small DNA fragments which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 
which encode PRGE may be cloned in recombinant DNA molecules that direct expression of 
PRGE, or fragments or functional equivalents thereof, in appropriate host cells. Due to the 
inherent degeneracy of the genetic code, other DNA sequences which encode substantially the 
10 same or a functionally equivalent amino acid sequence may be produced and used to express 
PRGE. 

The nucleotide sequences of the present invention can be engineered using methods 
generally known in the art in order to alter PRGE-encoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or expression of the 
15 gene product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments 
and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, 
oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create 
new restriction sites, alter glycosylation patterns, change codon preference, produce splice 
variants, and so forth. 

20 In another embodiment, sequences encoding PRGE may be synthesized, in whole or in 

part, using chemical methods well known in the art. (See, e.g., Caruthers, M.fi. et al. (1980) Nucl. 
Acids Res. Symp. Sen 215-223, and Horn, T. et al. (1980) Nucl. Acids Res. Symp. Ser. 225-232.) 
Alternatively, PRGE itself or a fragment thereof may be synthesized using chemical methods. For 
example, peptide synthesis can be performed using various solid-phase techniques, (See, e.g., 

25 Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using 
the ABI 431 A Peptide Synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of 
PRGE, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
from other proteins, or any part thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 

30 chromatography. (See, e.g, Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392- 
423 .) The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g.. Creighton, T. (1984) Proteins, Structures and Molecular Properties . WH 
Freeman, New York NY.) 

In order to express a biologically active PRGE, the nucleotide sequences encoding PRGE 

35 or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
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contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5' and 3* untranslated regions in the vector and in 
polynucleotide sequences encoding PRGE. Such elements may vary in their strength and 
5 specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding PRGE. Such signals include the ATG initiation codon and adjacent 
sequences, e.g. the Kozak sequence. In cases where sequences encoding PRGE and its initiation 
codon and upstream regulatory sequences are inserted into the appropriate expression vector, no 
additional transcriptional or translational control signals may be needed. However, in cases where 

10 only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals 
including an in- frame ATG initiation codon should be provided by the vector. Exogenous 
translational elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the 
particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 

15 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding PRGE and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA techniques, 
synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) 

20 Molecular Cloning. A Laboratory Manual Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
16-17; Ausubel, F.M. et al. (T995^ Current Protocols in Molecular Biology , John Wiley & Sons, 
New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding PRGE. These include, but are not limited to, microorganisms such as bacteria 

25 transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected with viral expression 
vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., 
cauliflower mosaic virus, CaMV, or tobacco mosaic virus,TMV) or with bacterial expression 
vectors (e.g., Ti or pBR322 plasmids); or animal cell systems. The invention is not limited by the 

30 host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected 
depending upon the use intended for polynucleotide sequences encoding PRGE. For example, 
routine cloning, subcloning, and propagation of polynucleotide sequences encoding PRGE can be 
achieved using a multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) 

35 or pSPORTl plasmid (Life Technologies). Ligation of sequences encoding PRGE into the 
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vector's multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure 
for identification of transformed bacteria containing recombinant molecules. In addition, these 
vectors may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with 
helper phage, and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. 

5 and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509.) When large quantities of PRGE are 
needed, e.g. for the production of antibodies, vectors which direct high level expression of PRGE 
may be used. For example, vectors containing the strong, inducible T5 or T7 bacteriophage 
promoter may be used. 

Yeast expression systems may be used for production of PRGE. A number of vectors 

10 containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, 
may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such vectors 
direct either the secretion or intracellular retention of expressed proteins and enable integration of 
foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra; 
Grant et al. (1987) Methods Enzymol. 153:516-54; and Scorer, C. A. et al. (1994) Bio/Technology 

15 12:181-184.) 

Plant systems may also be used for expression of PRGE. Transcription of sequences 
encoding PRGE may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N (1987) 
EMBO J. 6:307-3 1 1). Alternatively, plant promoters such as the small subunit of RUBISCO or 

20 heat shock promoters may be used. (See, e.g., Coruzzi, G. et ah (1984) EMBO J. 3:1671-1680; 
Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. ( 1 99 1 ) Results Probl. Cell 
Differ. 1 7:85-105.) These constructs can be introduced into plant cells by direct DNA 
transformation or pathogen-mediated transfection. (See, e.g.. The McGraw Hill Yearbook of 
Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196.) 

25 In mammalian cells, a number of viral-based expression systems may be utilized. In cases 

where an adenovirus is used as an expression vector, sequences encoding PRGE may be ligated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 
obtain infective virus which expresses PRGE in host cells. (See, e.g., Logan, J. and T. Shenk 

30 (1984) Proc. Natl. Acad. Sci. 81:3655-3659.) In addition, transcription enhancers, such as the 

Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 
SV40 or EBV-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments 
of DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb 

35 are constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
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polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, JJ. et al. (1997) Nat Genet. 
15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable 
expression of PRGE in cell lines is preferred. For example, sequences encoding PRGE can be 
5 transformed into cell lines using expression vectors which may contain viral origins of replication 
and/or endogenous expression elements and a selectable marker gene on the same or on a separate 
vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 
days in enriched media before being switched to selective media. The purpose of the selectable 
marker is to confer resistance to a selective agent, and its presence allows growth and recovery of 

10 cells which successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk or apr cells, respectively. (See, e.g., Wigler, M. et 

15 al. (1977) Cell 1 1:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, 

antibiotic, or herbicide resistance can be used as the basis for selection. For example, dhfr confers 
resistance to methotrexate; neo confers resistance to the aminoglycosides, neomycin and G-41 8; 
and ah or pat confer resistance to chlorsulfuron and phosphinotricin acetyl transferase, 
respectively. (See, e.g., Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-3570; 

20 Colbere-Garapin, F. et al. (1981) J. Mol, Biol. 150:1-14.) Additional selectable genes have been 
described, e.g., trpB and hisD, which alter cellular requirements for metabolites. (See, e.g., 
Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. 85:8047-8051.) Visible markers, 
e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), B glucuronidase and its substrate 
B-glucuronide, or luciferase and its substrate luciferin may be used. These markers can be used 

25 not only to identify transformants, but also to quantify the amount of transient or stable protein 
expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. (1995) Methods MoL 
Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. For 

30 example, if the sequence encoding PRGE is inserted within a marker gene sequence, transformed 
cells containing sequences encoding PRGE can be identified by the absence of marker gene 
function. Alternatively, a marker gene can be placed in tandem with a sequence encoding PRGE 
under the control of a single promoter. Expression of the marker gene in response to induction or 
selection usually indicates expression of the tandem gene as well. 

35 In general, host cells that contain the nucleic acid sequence encoding PRGE and that 
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express PRGE may be identified by a variety of procedures known to those of skill in the art. 
These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, 
solution, or chip based technologies for the detection and/or quantification of nucleic acid or 
5 protein sequences. 

Immunological methods for detecting and measuring the expression of PRGE using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELlSAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 

10 monoclonal antibodies reactive to two non- interfering epitopes on PRGE is preferred, but a 

competitive binding assay may be employed. These and other assays are well known in the art. 
(See, e.g., Hampton, R. et ah (1990) Serological Methods, a Laboratory Manual . APS Press, St 
Paul MN, Sect. IV; Coligan, J. E. et al. (1997) Current Protocols in Immunology , Greene Pub. 
Associates and Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical 

15 Protocols , Humana Press, Totowa NJ). 

A wide variety of labels and conjugation techniques are known by those skilled in the art 
and may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding PRGE 
include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled 

20 nucleotide. Alternatively, the sequences encoding PRGE, or any fragments thereof, may be 

cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures 
may be conducted using a variety of commercially available kits, such as those provided by 

25 Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter 
molecules or labels which may be used for ease of detection include radionuclides, enzymes, 
fluorescent, chemi luminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding PRGE may be cultured under 

30 conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the 
sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which encode PRGE may be designed to contain signal 
sequences which direct secretion of PRGE through a prokaryotic or eukaryotic cell membrane. 

35 In addition, a host cell strain may be chosen for its ability to modulate expression of the 
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inserted sequences or to process the expressed protein in the desired fashion. Such modifications 
of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-trans lational processing which cleaves a "prepro" 
form of the protein may also be used to specify protein targeting, folding, and/or activity. 
5 Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38), are available from 
the American Type Culture Collection (ATCC, Bethesda MD) and may be chosen to ensure the 
correct modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 

10 sequences encoding PRGE may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric PRGE protein 
containing a heterologous moiety that can be recognized by a commercially available antibody 
may facilitate the screening of peptide libraries for inhibitors of PRGE activity. Heterologous 
protein and peptide moieties may also facilitate purification of fusion proteins using commercially 

15 available affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase 
(GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6- 
His, FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification 
of their cognate fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, 
calmodulin, and metal-chelate resins, respectively. FLAG, c-myc^ and hemagglutinin (HA) enable 

20 immunoaffinity purification of fusion proteins using commercially available monoclonal and 

polyclonal antibodies that specifically recognize these epitope tags. A fusion protein may also be 
engineered to contain a proteolytic cleavage site located between the PRGE encoding sequence 
and the heterologous protein sequence, so that PRGE may be cleaved away from the heterologous 
moiety following purification. Methods for fusion protein expression and purification are 

25 discussed in Ausubel (1995, supra , ch 10). A variety of commercially available kits may also be 
used to facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled PRGE may be 
achieved in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract systems 
(Prornega). These systems couple transcription and translation of protein-coding sequences 

30 operably associated with the T7, T3, or SP6 promoters. Translation takes place in the presence of 
a radiolabeled amino acid precursor, preferably 35 S-methionine. 

Fragments of PRGE may be produced not only by recombinant production, but also by 
direct peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra, pp. 55-60.) 
Protein synthesis may be performed by manual techniques or by automation. Automated synthesis 

35 may be achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin-Elmer). Various 
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fragments of PRGE may be synthesized separately and then combined to produce the full length 
molecule. 

THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
5 between regions of PRGE and proteins regulating gene expression. In addition, the expression of 
PRGE is closely associated with cancer and other cell proliferative conditions, differentiated cells, 
inflammation and the immune response, and is found in reproductive and nervous system tissues. 
Therefore, PRGE appears to play a role in reproductive disorders, nervous disorders, and diseases 
associated with cell proliferation and differentiation, including cancer, immune disorders, and 

10 developmental disorders. In the treatment of the above conditions associated with increased 

PRGE expression or activity, it is desirable to decrease the expression or activity of PRGE. In the 
treatment of the above conditions associated with decreased PRGE expression or activity, it is 
desirable to increase the expression or activity of PRGE. 

Therefore, in one embodiment, PRGE or a fragment or derivative thereof may be 

15 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of PRGE. Examples of such a disorder include, but are not limited to, a reproductive 
disorder such as disorders of prolactin production; infertility, including tubal disease, ovulatory 
defects, and endometriosis; disruptions of the estrous cycle, disruptions of the menstrual cycle, 
polycystic ovary syndrome, ovarian hyperstimulation syndrome, endometrial and ovarian tumors, 

20 uterine fibroids, autoimmune disorders, ectopic pregnancies, and teratogenesis; cancer of the 
breast, fibrocystic breast disease, and galactorrhea; disruptions of spermatogenesis, abnormal 
sperm physiology, cancer of the testis, cancer of the prostate, benign prostatic hyperplasia, 
prostatitis, Peyronie's disease, impotence, carcinoma of the male breast, and gynecomastia; and a 
nervous disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, 

25 Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and 
other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, 
progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis 
and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, 
epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central 

30 nervous system disease; prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome; fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigerninal syndrome, mental retardation and other developmental disorders of the 
central nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system 

35 disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
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neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis; 
inherited, metabolic, endocrine, and toxic myopathies; myasthenia gravis, periodic paralysis; 
mental disorders including mood, anxiety, and schizophrenic disorders; akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
5 neuralgia, and Tourette's disorder; a cell proliferative disorder such as actinic keratosis, 

arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 
primary thrombocythemia; cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, 

10 bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, 
liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, 
testis, thymus, thyroid, and uterus; an immune disorder such as acquired immunodeficiency 
syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, allergies, ankylosing 
spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, 

15 autoimmune thyroiditis, bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic 
dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia with 
lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 

20 pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Re iter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and helminthic infections, trauma, X-linked 

25 agammaglobinemia of Bruton, common variable immunodeficiency (CVI), DiGeorge's syndrome 
(thymic hypoplasia), thymic dysplasia, isolated IgA deficiency, severe combined 
immunodeficiency disease (SOD), immunodeficiency with thrombocytopenia and eczema 
(Wiskott-Aldrich syndrome), Chediak-Higashi syndrome, chronic granulomatous diseases, 
hereditary angioneurotic edema, and immunodeficiency associated with Cushing's disease; and a 

30 developmental disorder such as renal tubular acidosis, anemia, Cushing's syndrome, 

achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal 
dysgenesis, WAGR syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and mental 
retardation), Smith-Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial 
dysplasia, hereditary keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease 

35 and neurofibromatosis, hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's 
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chorea and cerebral palsy, spina bifida, anencephaly, craniorachischisis, congenital glaucoma, 
cataract, sensorineural hearing loss, and any disorder associated with cell growth and 
differentiation, embryogenesis, and morphogenesis involving any tissue, organ, or system of a 
subject, e.g., the brain, adrenal gland, kidney, skeletal or reproductive system. 
5 In another embodiment, a vector capable of expressing PRGE or a fragment or derivative 

thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of PRGE including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially 
purified PRGE in conjunction with a suitable pharmaceutical carrier may be administered to a 

10 subject to treat or prevent a disorder associated with decreased expression or activity of PRGE 
including, but not limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of PRGE may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of PRGE including, but not limited to, those listed above. 

15 In a further embodiment, an antagonist of PRGE may be administered to a subject to treat 

or prevent a disorder associated with increased expression or activity of PRGE. Examples of such 
disorders include, but are not limited to, those described above. In one aspect, an antibody which 
specifically binds PRGE may be used directly as an antagonist or indirectly as a targeting or 
delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express PRGE. 

20 In an additional embodiment, a vector expressing the complement of the polynucleotide 

encoding PRGE may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of PRGE including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, 
complementary sequences, or vectors of the invention may be administered in combination with 

25 other appropriate therapeutic agents. Selection of the appropriate agents for use in combination 
therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical 
principles. The combination of therapeutic agents may act synergistically to effect the treatment 
or prevention of the various disorders described above. Using this approach, one may be able to 
achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for 

30 adverse side effects. 

An antagonist of PRGE may be produced using methods which are generally known in the 
art. In particular, purified PRGE may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind PRGE. Antibodies to PRGE may 
also be generated using methods that are well known in the art. Such antibodies may include, but 

35 are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Pab fragments, 
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and fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which 
inhibit dimer formation) are especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, 
humans, and others may be immunized by injection with PRGE or with any fragment or 
5 oligopeptide thereof which has immunogenic properties. Depending on the host species, various 
adjuvants may be used to increase immunological response. Such adjuvants include, but are not 
limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. 
Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum 
10 are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
PRGE have an amino acid sequence consisting of at least about 5 amino acids, and, more 
preferably, of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, 
or fragments are identical to a portion of the amino acid sequence of the natural protein and 
15 contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of 
PRGE amino acids may be fused with those of another protein, such as KLH, and antibodies to the 
chimeric molecule may be produced. 

Monoclonal antibodies to PRGE may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are 
20 not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 
hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. 
(1985) J. Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. 
80:2026-2030; and Cole, S.P. et al. (1984) MoL Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
25 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) 
Proc. Natl. Acad. Sci. 81 :685 1-6855; Neuberger, M.S. et al. (1984) Nature 3 12:604-608; and 
Takeda, S. et al. (1985) Nature 3 14:452-454.) Alternatively, techniques described for the 
production of single chain antibodies may be adapted, using methods known in the art, to produce 
30 PRGE-specific single chain antibodies. Antibodies with related specificity, but of distinct 
idiotypic composition, may be generated by chain shuffling from random combinatorial 
immunoglobulin libraries. (See, e.g., Burton D.R. (1991) Proc. Natl. Acad. Sci. 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents 
35 as disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. 86: 
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3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for PRGE may also be generated. 
For example, such fragments include, but are not limited to, F(ab')2 fragments produced by 
pepsin digestion of the antibody molecule and Fab fragments generated by reducing the disulfide 
5 bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to 
allow rapid and easy identification of monoclonal Fab fragments with the desired specificity. 
(See, e.g.,Huse, W.D. et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the 
desired specificity. Numerous protocols for competitive binding or immunoradiometric assays 

10 using either polyclonal or monoclonal antibodies with established specificities are well known in 
the art. Such immunoassays typically involve the measurement of complex formation between 
PRGE and its specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering PRGE epitopes is preferred, but a competitive binding 
assay may also be employed (Pound, supra ). 

15 Various methods such as Scatchard analysis in conjunction with radioimmunoassay 

techniques may be used to assess the affinity of antibodies for PRGE. Affinity is expressed as an 
association constant, K a , which is defined as the molar concentration of PRGE-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium 
conditions. The K a determined for a preparation of polyclonal antibodies, which are 

20 heterogeneous in their affinities for multiple PRGE epitopes, represents the average affinity, or 
avidity, of the antibodies for PRGE. The K a determined for a preparation of monoclonal 
antibodies, which are monospecific for a particular PRGE epitope, represents a true measure of 
affinity. High-affinity antibody preparations with K a ranging from about 1 0 9 to 10 12 L/mole are 
preferred for use in immunoassays in which the PRGE-antibody complex must withstand rigorous 

25 manipulations. Low-affinity antibody preparations with ranging from about 10 6 to 10 7 L/mole 
are preferred for use in immunopurification and similar procedures which ultimately require 
dissociation of PRGE, preferably in active form, from the antibody (Catty, D. (1988) Antibodies, 
Volume I: A Practical Approach , IRL Press, Washington, DC; Liddell, J. E. and Cryer, A. (1991) 
A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 

30 The titer and avidity of polyclonal antibody preparations may be further evaluated to 

determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is preferred for use in procedures requiring precipitation 
of PRGE-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, 

35 and guidelines for antibody quality and usage in various applications, are generally available. 
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(See, e.g., Catty, supra , and Coligan et al. supra .) 

In another embodiment of the invention, the polynucleotides encoding PRGE, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the 
complement of the polynucleotide encoding PRGE may be used in situations in which it would be 
5 desirable to block the transcription of the mRNA. In particular, cells may be transformed with 
sequences complementary to polynucleotides encoding PRGE. Thus, complementary molecules 
or fragments may be used to modulate PRGE activity, or to achieve regulation of gene function. 
Such technology is now well known in the art, and sense or antisense oligonucleotides or larger 
fragments can be designed from various locations along the coding or control regions of sequences 
10 encoding PRGE. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, 
or from various bacterial piasmids, may be used for delivery of nucleotide sequences to the 
targeted organ, tissue, or cell population. Methods which are well known to those skilled in the art 
can be used to construct vectors to express nucleic acid sequences complementary to the 

15 polynucleotides encoding PRGE. (See, e.g., Sam brook, supra ; Ausubel, 1995, supra.) 

Genes encoding PRGE can be turned off by transforming a cell or tissue with expression 
vectors which express high levels of a polynucleotide, or fragment thereof, encoding PRGE. Such 
constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Even 
in the absence of integration into the DNA, such vectors may continue to transcribe RNA 

20 molecules until they are disabled by endogenous nucleases. Transient expression may last for a 
month or more with a non-replicating vector, and may last even longer if appropriate replication 
elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or 

25 regulatory regions of the gene encoding PRGE. Oligonucleotides derived from the transcription 
initiation site, e.g., between about positions -10 and +10 from the start site, are preferred. 
Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix 
pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently 
for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic 

30 advances using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. 

(1994) in Huber, B.E. and B.I. Carr, Molecular and Immunologic Approaches , Futura Publishing, 
Mt. Kisco NY, pp. 163-177.) A complementary sequence or antisense molecule may also be 
designed to block translation of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage 

35 of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the 
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ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleavage of sequences encoding PRGE. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
5 scanning the target molecule for ribozyme cleavage sites, including the following sequences: 
GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 
ribonucleotides, corresponding to the region of the target gene containing the cleavage site, may 
be evaluated for secondary structural features which may render the oligonucleotide inoperable. 
The suitability of candidate targets may also be evaluated by testing accessibility to hybridization 

10 with complementary oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be 
prepared by any method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by in vitro 

15 and in vivo transcription of DNA sequences encoding PRGE. Such DNA sequences may be 

incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or 
SP6. Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 

20 modifications include, but are not limited to, the addition of flanking sequences at the 5 ! and/or 3 f 
ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as 
inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms 

25 of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by 
endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally 
suitable for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into 
stem cells taken from the patient and clonally propagated for autologous transplant back into that 

30 same patient. Delivery by transfection, by liposome injections, or by polycationic amino polymers 
may be achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. 
(1997) Nature Biotechnology 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, 

35 monkeys, and most preferably, humans. 
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An additional embodiment of the invention relates to the administration of a 
pharmaceutical or sterile composition, in conjunction with a pharmaceutical ly acceptable carrier, 
for any of the therapeutic effects discussed above. Such pharmaceutical compositions may consist 
of PRGE, antibodies to PRGE, and mimetics, agonists, antagonists, or inhibitors of PRGE. The 
5 compositions may be administered alone or in combination with at least one other agent, such as a 
stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical 
carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions 
may be administered to a patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any 
10 number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 

intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may contain 
suitable pharmaceutical ly-acceptable carriers comprising excipients and auxiliaries which 
15 facilitate processing of the active compounds into preparations which can be used 

pharmaceutical ly. Further details on techniques for formulation and administration may be found 
in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutical^ acceptable carriers well known in the art in dosages suitable for oral 
20 administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 
pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by 
the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with solid excipient and processing the resultant mixture of granules (optionally, after 

25 grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable 
excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, 
mannitol, and sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as 
methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethyl cellulose; gums, 
including arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, 

30 disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, 
agar, and alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated 
sugar solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or 

35 solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 
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product identification or to characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
5 starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty 
oils, liquid, or liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, 
10 Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
cellulose, sorbitol, or dextran. Additionally, suspensions of the active compounds may be 
prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include 
fatty oils, such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or 
15 liposomes. Non-lipid polycationic amino polymers may also be used for delivery. Optionally, the 
suspension may also contain suitable stabilizers or agents to increase the solubility of the 
compounds and allow for the preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art. 
20 The pharmaceutical compositions of the present invention may be manufactured in a 

manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and 
25 succinic acid. Salts tend to be more soluble in aqueous or other protonic solvents than are the 
corresponding free base forms. In other cases, the preferred preparation may be a lyophilized 
powder which may contain any or all of the following: 1 mM to 50 mM histidine, 0.1% to 2% 
sucrose, and 2% to 7% mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to 



use. 



30 After pharmaceutical compositions have been prepared, they can be placed in an 

appropriate container and labeled for treatment of an indicated condition. For administration of 
PRGE, such labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions 
wherein the active ingredients are contained in an effective amount to achieve the intended 

35 purpose. The determination of an effective dose is well within the capability of those skilled in the 
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art. 

For any compound, the therapeutically effective dose can be estimated initially either in 
cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, rabbits, dogs, or 
pigs. An animal model may also be used to determine the appropriate concentration range and 
5 route of administration. Such information can then be used to determine useful doses and routes 
for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
PRGE or fragments thereof, antibodies of PRGE, and agonists, antagonists or inhibitors of PRGE, 
which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
10 determined by standard pharmaceutical procedures in cell cultures or with experimental animals, 
such as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or 
LD 50 (the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic 
effects is the therapeutic index, and it can be expressed as the LD 50 /ED 50 ratio. Pharmaceutical 
compositions which exhibit large therapeutic indices are preferred. The data obtained from cell 
15 culture assays and animal studies are used to formulate a range of dosage for human use. The 
dosage contained in such compositions is preferably within a range of circulating concentrations 
that includes the ED 50 with little or no toxicity. The dosage varies within this range depending 
upon the dosage form employed, the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
20 subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of 
the active moiety or to maintain the desired effect. Factors which may be taken into account 
include the severity of the disease state, the general health of the subject, the age, weight, and 
gender of the subject, time and frequency of administration, drug combination(s), reaction 
sensitivities, and response to therapy. Long-acting pharmaceutical compositions may be 
25 administered every 3 to 4 days, every week, or biweekly depending on the half-life and clearance 
rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 ^g to 1 00,000 //g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
30 Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular 
cells, conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind PRGE may be used for the 
35 diagnosis of disorders characterized by expression of PRGE, or in assays to monitor patients being 
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treated with PRGE or agonists, antagonists, or inhibitors of PRGE. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for PRGE include methods which utilize the antibody and a label to detect 
PRGE in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 

5 without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known 
in the art and may be used. 

A variety of protocols for measuring PRGE, including ELISAs, RIAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of PRGE 

10 expression. Normal or standard values for PRGE expression are established by combining body 
fluids or cell extracts taken from normal mammalian subjects, preferably human, with antibody to 
PRGE under conditions suitable for complex formation. The amount of standard complex 
formation may be quantitated by various methods, preferably by photometric means. Quantities of 
PRGE expressed in subject, control, and disease samples from biopsied tissues are compared with 

15 the standard values. Deviation between standard and subject values establishes the parameters for 
diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding PRGE may be used 
for diagnostic purposes. The polynucleotides which may be used include oligonucleotide 
sequences, complementary RNA and DMA molecules, and PNAs. The polynucleotides may be 

20 used to detect and quantitate gene expression in biopsied tissues in which expression of PRGE 
may be correlated with disease. The diagnostic assay may be used to determine absence, 
presence, and excess expression of PRGE, and to monitor regulation of PRGE levels during 
therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting 

25 polynucleotide sequences, including genomic sequences, encoding PRGE or closely related 

molecules may be used to identify nucleic acid sequences which encode PRGE. The specificity of 
the probe, whether it is made from a highly specific region, e.g., the 5* regulatory region, or from a 
less specific region, e.g., a conserved motif, and the stringency of the hybridization or 
amplification (maximal, high, intermediate, or low), will determine whether the probe identifies 

30 only naturally occurring sequences encoding PRGE, allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and should preferably 
have at least 50% sequence identity to any of the PRGE encoding sequences. The hybridization 
probes of the subject invention may be DNA or RNA and may be derived from the sequence of 
SEQ ID NO:32-62 or from genomic sequences including promoters, enhancers, and introns of the 

35 PRGE gene. 
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Means for producing specific hybridization probes for DN As encoding PRGE include the 
cloning of polynucleotide sequences encoding PRGE or PRGE derivatives into vectors for the 
production ofmRNA probes. Such vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
5 polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding PRGE may be used for the diagnosis of disorders 
associated with expression of PRGE. Examples of such a disorder include, but are not limited to, a 
10 reproductive disorder such as disorders of prolactin production; infertility, including tubal disease, 
ovulatory defects, and endometriosis; disruptions of the estrous cycle, disruptions of the menstrual 
cycle, polycystic ovary syndrome, ovarian hyperstimuiation syndrome, endometrial and ovarian 
tumors, uterine fibroids, autoimmune disorders, ectopic pregnancies, and teratogenesis; cancer of 
the breast, fibrocystic breast disease, and galactorrhea; disruptions of spermatogenesis, abnormal 
15 sperm physiology, cancer of the testis, cancer of the prostate, benign prostatic hyperplasia, 

prostatitis, Peyronie's disease, impotence, carcinoma of the male breast, and gynecomastia; and a 
nervous disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, 
Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and 
other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, 
20 progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis 
and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, 
epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central 
nervous system disease; prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome; fatal familial insomnia, nutritional and metabolic diseases of the 
25 nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the 
central nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system 
disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis; 
30 inherited, metabolic, endocrine, and toxic myopathies; myasthenia gravis, periodic paralysis; 
mental disorders including mood, anxiety, and schizophrenic disorders; akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette's disorder; a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
35 (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, 

-37- 



WO 99/64596 PCT/US99/13281 

primary thrombocythemia; cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, 
bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, 
liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, 
5 testis, thymus, thyroid, and uterus; an immune disorder such as acquired immunodeficiency 
syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, allergies, ankylosing 
spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, 
autoimmune thyroiditis, bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic 
dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia with 
10 lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 

glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
15 lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and helminthic infections, trauma, X-linked 
agammaglobinemia of Bruton, common variable immunodeficiency (CVI), DiGeorge's syndrome 
(thymic hypoplasia), thymic dysplasia, isolated IgA deficiency, severe combined 
20 immunodeficiency disease (SCID), immunodeficiency with thrombocytopenia and eczema 
(Wiskott-Aldrich syndrome), Chediak-Higashi syndrome, chronic granulomatous diseases, 
hereditary angioneurotic edema, and immunodeficiency associated with Cushing's disease; and a 
developmental disorder such as renal tubular acidosis, anemia, Cushing's syndrome, 
achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal 
25 dysgenesis, WAGR syndrome (Wilms' tumor, aniridia, genitourinary abnormalities, and mental 
retardation), Smith-Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial 
dysplasia, hereditary keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease 
and neurofibromatosis, hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's 
chorea and cerebral palsy, spina bifida, anencephaly, craniorachischisis, congenital glaucoma, 
30 cataract, sensorineural hearing loss, and any disorder associated with cell growth and 

differentiation, embryogenesis, and morphogenesis involving any tissue, organ, or system of a 
subject, e.g., the brain, adrenal gland, kidney, skeletal or reproductive system. 

The polynucleotide sequences encoding PRGE may be used in Southern or northern 
analysis, dot blot, or other membrane-based technologies; in PCR technologies; in dipstick, pin, 
35 and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues from patients to 
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detect altered PRGE expression. Such qualitative or quantitative methods are well known in the 
art. 

In a particular aspect, the nucleotide sequences encoding PRGE may be useful in assays 
that detect the presence of associated disorders, particularly those mentioned above. The 
5 nucleotide sequences encoding PRGE may be labeled by standard methods and added to a fluid or 
tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes- After a suitable incubation period, the sample is washed and the signal is quantitated 
and compared with a standard value. If the amount of signal in the patient sample is significantly 
altered in comparison to a control sample then the presence of altered levels of nucleotide 
10 sequences encoding PRGE in the sample indicates the presence of the associated disorder. Such 
assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in 
animal studies, in clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
PRGE, a normal or standard profile for expression is established. This may be accomplished by 
15 combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding PRGE, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values from an experiment in which a known amount of a substantially 
purified polynucleotide is used. Standard values obtained in this manner may be compared with 
20 values obtained from samples from patients who are symptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in 
the patient begins to approximate that which is observed in the normal subject. The results 
25 obtained from successive assays may be used to show the efficacy of treatment over a period 
ranging from several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the 
30 appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow health 
professionals to employ preventative measures or aggressive treatment earlier thereby preventing 
the development or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
PRGE may involve the use of PCR. These oligomers may be chemically synthesized, generated 
35 enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a 
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polynucleotide encoding PRGE, or a fragment of a polynucleotide complementary to the 
polynucleotide encoding PRGE, and will be employed under optimized conditions for 
identification of a specific gene or condition. Oligomers may also be employed under less 
stringent conditions for detection or quantitation of closely related DNA or RNA sequences. 

5 Methods which may also be used to quantitate the expression of PRGE include 

radiolabeling or biotinylating nucleotides, coamplifl cation of a control nucleic acid, and 
interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. 
Methods 159:235-244; Duplaa, C. et ah (1993) Anal. Biochem. 229-236.) The speed of 
quantitation of multiple samples may be accelerated by running the assay in an ELISA format 

10 where the oligomer of interest is presented in various dilutions and a spectrophotometric or 
colorimetric response gives rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as targets in a microarray. The 
microarray can be used to monitor the expression level of large numbers of genes simultaneously 

15 and to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, and 
to develop and monitor the activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, 
e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. 

20 Acad. Sci. 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 116; Shalon, 
D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. 
94:2150-2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding PRGE may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic 

25 sequence. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial 
PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) 
Nat Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends 

30 Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical 
chromosome mapping techniques and genetic map data. (See, e.g., Heinz-UIrich, et al. (1995) in 
Meyers, supra, pp. 965-968.) Examples of genetic map data can be found in various scientific 
journals or at the Online Mendelian Inheritance in Man (OMIM) site. Correlation between the 

35 location of the gene encoding PRGE on a physical chromosomal map and a specific disorder, or a 
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predisposition to a specific disorder, may help define the region of DNA associated with that 
disorder. The nucleotide sequences of the invention may be used to detect differences in gene 
sequences among normal, carrier, and affected individuals. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such 
5 as linkage analysis using established chromosomal markers, may be used for extending genetic 
maps. Often the placement of a gene on the chromosome of another mammalian species, such as 
mouse, may reveal associated markers even if the number or arm of a particular human 
chromosome is not known. New sequences can be assigned to chromosomal arms by physical 
mapping. This provides valuable information to investigators searching for disease genes using 

10 positional cloning or other gene discovery techniques. Once the disease or syndrome has been 
crudely localized by genetic linkage to a particular genomic region, e.g., ataxia-telangi ectasia to 
1 lq22-23, any sequences mapping to that area may represent associated or regulatory genes for 
further investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide 
sequence of the subject invention may also be used to detect differences in the chromosomal 

15 location due to translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, PRGE, its catalytic or immunogenic fragments, 
or oligopeptides thereof can be used for screening libraries of compounds in any of a variety of 
drug screening techniques. The fragment employed in such screening may be free in solution, 
affixed to a solid support, borne on a cell surface, or located intracellularly. The formation of 

20 binding complexes between PRGE and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of 
compounds having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. 
(1984) PCT application WO84/03564.) In this method, large numbers of different small test 
compounds are synthesized on a solid substrate. The test compounds are reacted with PRGE, or 

25 fragments thereof, and washed. Bound PRGE is then detected by methods well known in the art. 
Purified PRGE can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to capture the 
peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which 

30 neutralizing antibodies capable of binding PRGE specifically compete with a test compound for 
binding PRGE. In this manner, antibodies can be used to detect the presence of any peptide which 
shares one or more antigenic determinants with PRGE. 

In additional embodiments, the nucleotide sequences which encode PRGE may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely 

35 on properties of nucleotide sequences that are currently known, including, but not limited to, such 
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properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the 
preceding description, utilize the present invention to its fullest extent. The following preferred 
specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of 
5 the remainder of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, 
in particular U.S. Ser. No. 60/089,029, U.S. Ser. No. 60/094,575, and U.S. Ser. No. 60/104,624, 
are hereby expressly incorporated by reference. 

EXAMPLES 

10 I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were 
homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting 
15 lysates were centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated 
from the lysates with either isopropanol or sodium acetate and ethanol, or by other routine 
methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was 

20 isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 
(QIAGEN, Valencia CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding 

25 cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), 
using the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 
1997, supra , units 5.1-6.6). Reverse transcription was initiated using oligo d(T) or random 
primers. Synthetic oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA 

30 was digested with the appropriate restriction enzyme or enzymes. For most libraries, the cDNA 
was size-selected (300-1000 bp) using SEPHACRYL SI 000, SEPHAROSE CL2B, or 
SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or preparative 
agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the 
polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), pSPORTl plasmid 

35 (Life Technologies), or pINCY (Incyte Pharmaceuticals, Palo Alto CA). Recombinant plasm ids 
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were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR from 
Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. II. 
Isolation of cDNA Clones 

Plasmids were recovered from host ceils by in vivo excision, using the UNIZAP vector 
5 system (Stratagene) or cell lysis. Plasmids were purified using at least one of the following: a 
Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC Miniprep 
purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 
Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the REAL Prep 96 plasmid kit 
from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of distilled water 
10 and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in 
a high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and 
thermal cycling steps were carried out in a single reaction mixture. Samples were processed and 
stored in 384-well plates, and the concentration of amplified plasmid DNA was quantified 
15 fluorometrically using PICOGREEN dye (Molecular Probes, Eugene OR) and a Fluoroskan II 
fluorescence scanner (Labsystems Oy, Helsinki, Finland). 
IIL Sequencing and Analysis 

The cDNAs were prepared for sequencing using the ABI CATALYST 800 (Perkin-Elmer) 
or the HYDRA microdispenser (Robbins Scientific) or MICROLAB 2200 (Hamilton) systems in 
20 combination with the PTC-200 thermal cyclers (MJ Research). The cDNAs were sequenced using 
the ABI PRISM 373 or 377 sequencing systems (Perkin-Elmer) and standard ABI protocols, base 
calling software, and kits. In one alternative, cDNAs were sequenced using the MEGABACE 
1000 DNA sequencing system (Molecular Dynamics). In another alternative, the cDNAs were 
amplified and sequenced using the ABI PRISM BIGDYE Terminator cycle sequencing ready 
25 reaction kit (Perkin-Elmer). In yet another alternative, cDNAs were sequenced using solutions 
and dyes from Amersham Pharmacia Biotech. Reading frames for the ESTs were determined 
using standard methods (reviewed in Ausubel, 1997, supra, unit 7.7). Some of the cDNA 
sequences were selected for extension using the techniques disclosed in Example V. 

The polynucleotide sequences derived from cDNA, extension, and shotgun sequencing 
30 were assembled and analyzed using a combination of software programs which utilize algorithms 
well known to those skilled in the art. Table 5 summarizes the software programs, descriptions, 
references, and threshold parameters used. The first column of Table 5 shows the tools, programs, 
and algorithms used, the second column provides a brief description thereof, the third column 
presents the references which are incorporated by reference herein, and the fourth column 
35 presents, where applicable, the scores, probability values, and other parameters used to evaluate 
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the strength of a match between two sequences (the higher the probability the greater the 
homology). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
5 sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then 
queried against a selection of public databases such as GenBank primate, rodent, mammalian, 
vertebrate, and eukaryote databases, and BLOCKS to acquire annotation, using programs based 
on BLAST, FASTA, and BLIMPS. The sequences were assembled into full length polynucleotide 
10 sequences using programs based on Phred, Phrap, and Consed, and were screened for open 
reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
polynucleotide sequences were translated to derive the corresponding full length amino acid 
sequences, and these full length sequences were subsequently analyzed by querying against 
databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
15 Prosite, and Hidden Markov Model (HMM)-based protein family databases such as PFAM. 
HMM is a probalistic approach which analyzes consensus primary structures of gene families. 
(See, e.g., Eddy, S.R. (1996) Cur. Opin. Str. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide 
and amino acid sequences were also used to identify polynucleotide sequence fragments from 
20 SEQ ID NO:32-62. Fragments from about 20 to about 4000 nucleotides which are useful in 
hybridization and amplification technologies were described in The Invention section above. 
IV. Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
25 RNAs from a particular cell type or tissue have been bound. (See, e.g., Sam brook, .supra, ch. 7; 
Ausubel, 1995, supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or 
related molecules in nucleotide databases such as GenBank or LIFESEQ database (Incyte 
Pharmaceuticals). This analysis is much faster than multiple membrane-based hybridizations. In 
30 addition, the sensitivity of the computer search can be modified to determine whether any 

particular match is categorized as exact or similar. The basis of the search is the product score, 
which is defined as: 

% sequence identity x % maximum BLAST score 
100 

35 The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. For example, with a product score of 40, the match will be exact 
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within a 1% to 2% error, and, with a product score of 70, the match will be exact. Similar 
molecules are usually identified by selecting those which show product scores between 15 and 40, 
although lower scores may identify related molecules. 

The results of northern analyses are reported as a percentage distribution of libraries in 
5 which the transcript encoding PRGE occurred. Analysis involved the categorization of cDNA 
libraries by organ/tissue and disease. The organ/tissue categories included cardiovascular, 
dermatologic, developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, 
nervous, reproductive, and urologic. The disease/condition categories included cancer, 
inflammation/trauma, cell proliferation, neurological, and pooled. For each category, the number 
10 of libraries expressing the sequence of interest was counted and divided by the total number of 
libraries across all categories. Percentage values of tissue-specific and disease- or condition- 
specific expression are reported in Table 3. 
V. Extension of PRGE Encoding Polynucleotides 

The full length nucleic acid sequences of SEQ ID NO:56-62 were produced by extension 
15 of an appropriate fragment of the full length molecule using oligonucleotide primers designed 
from this fragment. One primer was synthesized to initiate 5' extension of the known fragment, 
and the other primer, to initiate 3* extension of the known fragment. The initial primers were 
designed using OLIGO 4.06 software (National Biosciences), or another appropriate program, to 
be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal 
20 to the target sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides 
which would result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. 
25 PCR was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The 
reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , 
(NH 4 ) 2 S0 4 , and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), 
ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the 
following parameters for primer pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; 
30 Step 3: 60°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 

68°C, 5 min; Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ 
were as follows: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68 Q C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 
The concentration of DNA in each well was determined by dispensing 100 \i\ 
35 PICOGREEN quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) 
dissolved in IX TE and 0.5 pi of undiluted PCR product into each well of an opaque fluorimeter 
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plate (Corning Costar, Acton MA), allowing the DNA to bind to the reagent. The plate was 
scanned in a Fluoroskan 11 (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the 
sample and to quantify the concentration of DNA. A 5 fA to 10 ju) aliquot of the reaction mixture 
was analyzed by electrophoresis on a 1 % agarose mini-ge! to determine which reactions were 
5 successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 1 8 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
10 agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 
(Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in 
restriction site overhangs, and transfected into competent j^_coH cells. Transformed cells were 
selected on antibiotic-containing media, individual colonies were picked and cultured overnight at 
15 37°C in 384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 
parameters: Step 1 : 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
20 quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low 
DNA recoveries were reamplified using the same conditions as described above. Samples were 
diluted with 20% dimethysulphoxide (1 :2, v/v), and sequenced using D YEN AMI C energy transfer 
sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 
25 In like manner, the nucleotide sequences of SEQ ID NO:56-62 are used to obtain 5' 

regulatory sequences using the procedure above, oligonucleotides designed for such extension, 
and an appropriate genomic library. 

The nucleic acid sequences of SEQ ID NO:32-55 were used to design oligonucleotide 
primers for extending a partial nucleotide sequence to full length. For each nucleic acid sequence, 
30 one primer was synthesized to initiate extension of an antisense polynucleotide, and the other was 
synthesized to initiate extension of a sense polynucleotide. Primers were used to facilitate the 
extension of the known sequence "outward" generating amplicons containing new unknown 
nucleotide sequence for the region of interest. The initial primers were designed from the cDNA 
using OLIGO™ 4.06 (National Biosciences, Plymouth, MN), or another appropriate program, to 
35 be about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal 
to the target sequence at temperatures of about 68 °C to about 72°C. Any stretch of nucleotides 
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which would result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries (GlBCO BRL) were used to extend the sequence. If more 
than one extension is necessary or desired, additional sets of primers are designed to further 
extend the known region. 
5 High fidelity amplification was obtained by following the instructions for the XL-PCR™ 

kit (Perkin Elmer) and thoroughly mixing the enzyme and reaction mix. PCR was performed 
using the Peltier Thermal Cycler (PTC200; M.J. Research, Watertown, MA), beginning with 40 
pmol of each primer and the recommended concentrations of all other components of the kit, with 
the following parameters: 



10 Step 1 94° C for 1 min (initial denaturation) 

Step 2 65° C fori min 

Step 3 68° C for 6 min 

Step 4 94° C for 15 sec 

Step 5 65° C fori min 

15 Step 6 68° C for 7 min 

Step 7 Repeat steps 4 through 6 for an additional 1 5 cycles 

Step 8 94° C for 15 sec 

Step 9 65 ° C for 1 min 

Step 10 68° C for 7:15 min 

20 Step 1 1 Repeat steps 8 through 1 0 for an additional 12 cycles 

Step 12 72° C for 8 min 

Step 13 4° C (and holding) 



A 5 Adl to 10 fi\ aliquot of the reaction mixture was analyzed by electrophoresis on a low 

25 concentration (about 0.6% to 0.8%) agarose mini-gel to determine which reactions were successful 
in extending the sequence. Bands thought to contain the largest products were excised from the 
gel, purified using QIAQUICK™ (QIAGEN Inc.), and trimmed of overhangs using Klenow 
enzyme to facilitate religation and cloning. 

After ethanol precipitation, the products were redissolved in 13 fA of ligation buffer, 1/J 

30 T4-DNA ligase (15 units) and ) /u\ T4 polynucleotide kinase were added, and the mixture was 

incubated at room temperature for 2 to 3 hours, or overnight at 16° C. Competent E. coli cells (in 
40 yul of appropriate media) were transformed with 3 fA of ligation mixture and cultured in 80 /^I 
of SOC medium. (See, e.g., Sambrook, supra . Appendix A, p. 2.) After incubation for one hour at 
37°C, the E. coli mixture was plated on Luria Bertani (LB) agar (See, e.g., Sambrook, supra, 

35 Appendix A, p. 1) containing carbeniciliin (2x carb). The following day, several colonies were 
randomly picked from each plate and cultured in 150 /u\ of liquid LB/2x carb medium placed in an 
individual well of an appropriate commercially-available sterile 96-well microtiter plate. The 
following day, 5 /u\ of each overnight culture was transferred into a non-sterile 96-well plate and, 
after dilution 1:10 with water, 5 fu.\ from each sample was transferred into a PCR array. 

40 For PCR amplification, 1 8 /u\ of concentrated PCR reaction mix (3.3x) containing 4 units 
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of rTth DNA polymerase, a vector primer, and one or both of the gene specific primers used for 
the extension reaction were added to each well. Amplification was performed using the following 
conditions: 

Step 1 94° C for 60 sec 

5 Step 2 94° C for 20 sec 

Step 3 55° C for 30 sec 

Step 4 72° C for 90 sec 

Step 5 Repeat steps 2 through 4 for an additional 29 cycles 

Step 6 72° C for 180 sec 

10 Step 7 4° C (and holding) 

Aliquots of the PCR reactions were run on agarose gels together with molecular weight 
markers. The sizes of the PCR products were compared to the original partial cDNAs, and 
appropriate clones were selected, ligated into plasmid, and sequenced. 
15 In like manner, the nucleotide sequences of SEQ ID NO:32-55 are used to obtain 5' 

regulatory sequences using the procedure above, oligonucleotides designed for 5' extension, and 
an appropriate genomic library. 

VI. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ IDNO:32-62 are employed to screen cDNAs, 

20 genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 
base pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 fuCi of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase t 

25 (DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 

SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane- 
based hybridization analysis of human genomic DNA digested with one of the following 
endonucleases: Ase I, Bgl II, Eco Rl, Pst I, Xbal , or Pvu II (DuPont NEN). 

30 The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5% sodium dodecyl 
sulfate. After XOMAT-AR film (Eastman Kodak, Rochester NY) is exposed to the blots to film 

35 for several hours, hybridization patterns are compared visually. 

VII. Microarrays 

A chemical coupling procedure and an ink jet device can be used to synthesize array 
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elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra.) An array analogous to a 
dot or slot blot may also be used to arrange and link elements to the surface of a substrate using 
thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced by 
hand or using available methods and machines and contain any appropriate number of elements. 
5 After hybridization, nonhybridized probes are removed and a scanner used to determine the levels 
and patterns of fluorescence. The degree of complementarity and the relative abundance of each 
probe which hybridizes to an element on the microarray may be assessed through analysis of the 
scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may 
10 comprise the elements of the microarray. Fragments suitable for hybridization can be selected 
using software well known in the art such as LASERGENE software (DNASTAR). Full-length 
cDNAs, ESTs, or fragments thereof corresponding to one of the nucleotide sequences of the 
present invention, or selected at random from a cDNA library relevant to the present invention, are 
arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., 
15 UV cross-linking followed by thermal and chemical treatments and subsequent drying. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) 
Fluorescent probes are prepared and used for hybridization to the elements on the substrate. The 
substrate is analyzed by procedures described above. 
VTTL Complementary Polynucleotides 
20 Sequences complementary to the PRGE-encoding sequences, or any parts thereof, are 

used to detect, decrease, or inhibit expression of naturally occurring PRGE. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides 
are designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of 
25 PRGE. To inhibit transcription, a complementary oligonucleotide is designed from the most 
unique 5' sequence and used to prevent promoter binding to the coding sequence. To inhibit 
translation, a complementary oligonucleotide is designed to prevent ribosomal binding to the 
PRGE-encoding transcript. 
IX. Expression of PRGE 
30 Expression and purification of PRGE is achieved using bacterial or virus-based 

expression systems. For expression of PRGE in bacteria, cDNA is subcloned into an appropriate 
vector containing an antibiotic resistance gene and an inducible promoter that directs high levels 
of cDNA transcription. Examples of such promoters include, but are not limited to, the trp-lac 
(tac) hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac 
35 operator regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, 
e.g., BL21(DE3). Antibiotic resistant bacteria express PRGE upon induction with isopropyl beta- 
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D-thiogalactopyranoside (1PTG). Expression of PRGE in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis 
virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of 
baculovirus is replaced with cDNA encoding PRGE by either homologous recombination or 
5 bacterial-mediated transposition involving transfer plasmid intermediates. Viral infectivity is 
maintained and the strong polyhedrin promoter drives high levels of cDNA transcription. 
Recombinant baculovirus is used to infect Spodoptera frugiperda (Sf9) insect cells in most cases, 
or human hepatocytes, in some cases. Infection of the latter requires additional genetic 
modifications to baculovirus. (See Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Sci. USA 
10 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

In most expression systems, PRGE is synthesized as a fusion protein with, e.g., 
glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, 
single-step, affinity-based purification of recombinant fusion protein from crude ceil lysates. 
GST, a 26-kilodalton enzyme from Schistosoma iaponicurru enables the purification of fusion 
15 proteins on immobilized glutathione under conditions that maintain protein activity and 

antigenicity (Amersham Pharmacia Biotech). Following purification, the GST moiety can be 
proteolytically cleaved from PRGE at specifically engineered sites. FLAG, an 8-amino acid 
peptide, enables immunoaffinity purification using commercially available monoclonal and 
polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a stretch of six consecutive histidine 
20 residues, enables purification on metal-chelate resins (QIAGEN). Methods for protein expression 
and purification are discussed in Ausubel (1995. supra , ch 10 and 16). Purified PRGE obtained by 
these methods can be used directly in the following activity assay. 
X* Demonstration of PRGE Activity 

PRGE activity is measured by its ability to stimulate transcription of a reporter gene (Liu, 
25 H.Y. et al. (1997) EMBO J. 16(17):5289-5298.) The assay entails the use of a well characterized 
reporter gene construct, LexA op -LacZ, that consists of LexA DNA transcriptional control 
elements (LexA op ) fused to sequences encoding the E. coli LacZ enzyme. The methods for 
constructing and expressing fusion genes, introducing them into cells, and measuring LacZ 
enzyme activity, are well known to those skilled in the art. Sequences encoding PRGE are cloned 
30 into a plasmid that directs the synthesis of a fusion protein, LexA-PRGE, consisting of PRGE and 
a DNA binding domain derived from the LexA transcription factor. The resulting plasmid, 
encoding a LexA-PRGE fusion protein, is introduced into yeast cells along with a plasmid 
containing the LexA op -LacZ reporter gene. The amount of LacZ enzyme activity associated with 
LexA-PRGE transfected cells, relative to control cells, is proportional to the amount of 
35 transcription stimulated by the PRGE. 
XI. Functional Assays 
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PRGE function is assessed by expressing the sequences encoding PRGE at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
mammalian expression vector containing a strong promoter that drives high levels of cDNA 
expression. Vectors of choice include pCMV SPORT (Life Technologies) and pCR3.1 
5 (Invitrogen, Carlsbad CA), both of which contain the cytomegalovirus promoter. 5- 1 0 fug of 
recombinant vector are transiently transfected into a human cell line, preferably of endothelial or 
hematopoietic origin, using either liposome formulations or electroporation. 1-2 ^g of an 
additional plasmid containing sequences encoding a marker protein are co-transfected. Expression 
of a marker protein provides a means to distinguish transfected cells from nontransfected cells and 
10 is a reliable predictor of cDNA expression from the recombinant vector. Marker proteins of 
choice include, e.g., Green Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion 
protein. Flow cytometry (FCM), an automated, laser optics-based technique, is used to identify 
transfected cells expressing GFP or CD64-GFP, and to evaluate properties, for example, their 
apoptotic state. FCM detects and quantifies the uptake of fluorescent molecules that diagnose 
15 events preceding or coincident with cell death. These events include changes in nuclear DNA 
content as measured by staining of DNA with propidium iodide; changes in cell size and 
granularity as measured by forward light scatter and 90 degree side light scatter; down-regulation 
of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
expression of cell surface and intracellular proteins as measured by reactivity with specific 
20 antibodies; and alterations in plasma membrane composition as measured by the binding of 
fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M. G. (1994) Flow Cytometry , Oxford, New York NY. 

The influence of PRGE on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding PRGE and either CD64 or CD64-GFP. 
25 CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved 
regions of human immunoglobulin G (IgG). Transfected cells are efficiently separated from 
nontransfected cells using magnetic beads coated with either human IgG or antibody against CD64 
(DYNAL, Lake Success NY). mRNA can be purified from the cells using methods well known 
by those of skill in the art. Expression of mRNA encoding PRGE and other genes of interest can 
30 be analyzed by northern analysis or microarray techniques. 
XII. Production of PRGE Specific Antibodies 

PRGE substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods EnzymoL 182:488-495), or other purification techniques, is 
used to immunize rabbits and to produce antibodies using standard protocols. 
35 Alternatively, the PRGE amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
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synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are 
well described in the art. (See, e.g., Ausubel, 1995, supra, ch. 1 1.) 

Typically, oligopeptides 15 residues in length are synthesized using an ABI 431 A 
5 Peptide Synthesizer (Perkin-Elmer) using fmoc-chemistry and coupled to KLH (Sigma-Aldrich, 
St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity. (See, e.g., Ausubel, 1995, supra.) Rabbits are immunized with the 
oligopeptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for 
antipeptide activity by, for example, binding the peptide to plastic, blocking with 1% BSA, 
10 reacting with rabbit antisera, washing, and reacting with radio- iodinated goat anti-rabbit IgG. 
XIII. Purification of Naturally Occurring PRGE Using Specific Antibodies 

Naturally occurring or recombinant PRGE is substantially purified by immunoaffinity 
chromatography using antibodies specific for PRGE. An immunoaffinity column is constructed 
by covalently coupling anti-PRGE antibody to an activated chromatographic resin, such as 
15 CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing PRGE are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of PRGE (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
20 antibody/PRGE binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion), and PRGE is collected. 
XTV. Identification of Molecules Which Interact with PRGE 

PRGE, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter 
reagent. (See, e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate molecules previously 
25 arrayed in the wells of a multi-well plate are incubated with the labeled PRGE, washed, and any 
wells with labeled PRGE complex are assayed. Data obtained using different concentrations of 
PRGE are used to calculate values for the number, affinity, and association of PRGE with the 
candidate molecules. 

Various modifications and variations of the described methods and systems of the 
30 invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly limited 
to such specific embodiments. Indeed, various modifications of the described modes for carrying 
out the invention which are obvious to those skilled in molecular biology or related fields are 
35 intended to be within the scope of the following claims. 
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1 . A substantially purified polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ IDNO:l, SEQ ID NO:2, SEQ ID NO:3 ? SEQ ID NO:4, SEQ ID 
5 NO:5, SEQ IDNO:6, SEQ ID NO;7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID 
NO:ll, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ 
ID NO: 1 7, SEQ ID NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:20, SEQ ID NO:2 1 , SEQ ID NO:22, 
SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO;25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID 
NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:3 1 , and fragments thereof. 
10 2. A substantially purified variant having at least 90% amino acid sequence identity 

to the amino acid sequence of claim 1 . 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 1 . 

4. An isolated and purified polynucleotide variant having at least 90% 
polynucleotide sequence identity to the polynucleotide of claim 3, 

15 5. An isolated and purified polynucleotide which hybridizes under stringent 

conditions to the polynucleotide of claim 3, 

6. An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 3. 

I . A method for detecting a polynucleotide, the method comprising the steps of: 

20 (a) hybridizing the polynucleotide of claim 6 to at least one nucleic acid in a 

sample, thereby forming a hybridization complex; and 

(b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of the polynucleotide in the sample. 

8. The method of claim 7 further comprising amplifying the polynucleotide prior to 
25 hybridization. 

9. An isolated and purified polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID 
NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ 
ID NO:41 , SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, 

30 SEQ ID NO:47, SEQ ID NO:48, SEQIDNO:49, SEQIDNO.50, SEQIDNO:51, SEQ ID 

NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ 
ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, and fragments 
thereof 

10. An isolated and purified polynucleotide variant having at least 90% 
35 polynucleotide sequence identity to the polynucleotide of claim 9. 

I I . An isolated and purified polynucleotide having a sequence which is 
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complementary to the polynucleotide of claim 9. 

12. An expression vector comprising at least a fragment of the polynucleotide of 

claim 3. 

13. A host cell comprising the expression vector of claim 12. 

5 1 4. A method for producing a polypeptide, the method comprising the steps of: 

a) culturing the host cell of claim 13 under conditions suitable for the 
expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

15. A pharmaceutical composition comprising the polypeptide of claim 1 in 
10 conjunction with a suitable pharmaceutical carrier. 

16. A purified antibody which specifically binds to the polypeptide of claim 1 . 

1 7. A purified agonist of the polypeptide of claim 1 . 

18. A purified antagonist of the polypeptide of claim 1 . 

19. A method for treating or preventing a disorder associated with decreased 

15 expression or activity of PRGE, the method comprising administering to a subject in need of such 
treatment an effective amount of the pharmaceutical composition of claim 15. 

20. A method for treating or preventing a disorder associated with increased 
expression or activity of PRGE, the method comprising administering to a subject in need of such 
treatment an effective amount of the antagonist of claim 18. 
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Met Val Phe Phe Thr Cys Asn Ala Cys Gly Glu Ser Val Lys Lys 

15 10 15 

He Gin Val Glu Lys His Val Ser Val Cys Arg Asn Cys Glu Cys 

20 25 30 

Leu Ser Cys lie Asp Cys Gly Lys Asp Phe Trp Gly Asp Asp Tyr 

35 40 45 

Lys Asn His Val Lys Cys He Ser Glu Asp Gin Lys Tyr Gly Gly 

50 55 60 

Lys Gly Tyr Glu Gly Lys Thr His Lys Gly Asp lie Lys Gin Gin 

65 70 75 

Ala Trp lie Gin Lys He Ser Glu Leu He Lys Arg Pro Asn Val 

80 85 90 

Ser Pro Lys Val Arg Glu Leu Leu Glu Gin He Ser Ala Phe Asp 

95 100 105 

Asn Val Pro Arg Lys Lys Ala Lys Phe Gin Asn Trp Met Lys Asn 

110 115 120 

Ser Leu Lys Val His Asn Glu Ser He Leu Asp Gin Val Trp Asn 



1/66 



WO 99/64596 



PCT/US99/I3281 











125 










130 










135 


lie 


Phe 


Ser 


Glu 


Ala 
140 


Ser 


Asn 


Ser 


Glu 


Pro 
145 


Val 


Asn 


Lys 


Glu 


Gin 
150 


Asp 


Gin 


Arg 


Pro 


Leu 
155 


His 


Pro 


Val 


Ala 


Asn 
160 


Pro 


His 


Ala 


Glu 


lie 
165 


Ser 


Thr 


Lys 


Val 


Pro 
170 


Ala 


Ser 


Lys 


Val 
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Glu 


Glu 


Ala 


Arg 


Val 


Gly Ala 


Gly 


Lys 










260 










265 










270 


Arg 


Lys 


Arg 


Arg 


His 
275 


Ser 


Glu 


Val 


Glu 


Thr 
280 


Asp 


Ser 


Lys 


Lys 


285 


Lys 


Met 


Lys 


Leu 


Pro 
290 


Glu 


His 


Pro 


Glu 


Gly 
295 


Gly 


Glu 


Pro 


Glu 


300 


Asp 


Glu 


Ala 


Pro 


Ala 
305 


Lys 


Gly 


Lys 


Phe 


Asn 
310 


TriD 


Lys 


Gly 


Thr 


Tip 

lie 

J ± O 


Lys 


Ala 


He 


Leu 


Lys 
320 


Gin 


Ala 


Pro 


Asp 


Asn 
325 


Glu 


He 


Thr 


He 


Lys 

330 


Lys 


Leu 


Arg 


Lys 


Lys 
335 


Val 


Leu 


Ala 


Gin 


Tyr 
340 


Tyr 


Thr 


Val 


Thr 


Asp 
345 


Glu 


His 


His 


Arg 


Ser 
350 


Glu 


Glu 


Glu 


Leu 


Leu 
355 


Val 


He 


Phe 


Asn 


Lys 
360 


Lys 


He 


Ser 


Lys 


Asn 
365 


Pro 


Thr 


Phe 


Lys 


Leu 
370 


Leu 


Lys 


Asp 


Lys 


Val 
375 


Lys 


Leu 


Val 


Lys 

























<210> 2 
<211> 136 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Clone No. : 815856 

<400> 2 

Met Phe Gly Thr Pro Gin Glu His 

1 5 
Ala Met Val Leu Val Ala Arg Asn 
20 

Cys Arg Glu Lys Glu Arg Glu Glu 
35 

Met Ala Asp Glu Ala He Leu Gin 
50 

Glu Glu Gly Val Arg Gly Gly His 



Arg 


Asn 


Met 


Pro 


Gin 


Ala 


Asp 




10 










15 


Tyr 


Glu 


Arg 


Tyr 


Lys 


Asn 


Glu 




25 










30 


He 


Ala 


Arg 


Gin 


Ala 


Ala 


Lys 




40 










45 


Glu 


Arg 


Glu 


Arg 


Gly 


Gly 


Pro 




55 










60 


Pro 


Pro 


Ala 


He 


Gin 


Ser 


Leu 
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65 70 75 

lie Asn Leu Leu Ala Asp Asn Arg Tyr Leu Thr Ala Glu Glu Thr 

80 85 90 

Asp Lys lie lie Asn Tyr Leu Arg Glu Arg Lys Glu Arg Leu Met 

95 100 105 

Arg Ser Ser Thr Asp Ser Leu Pro Gly Glu Leu Arg Gly Arg Ala 

110 115 120 

Glu Ala Arg Phe Pro Ala Asn His Ser Gly Arg Pro Arg Val Pro 

125 130 135 

Arg 



<210> 3 

<211> 230 

<212> PRT 

<213> Homo sapiens 

<220> 

<22l> misc_f eature 

<223> Incyte Clone No. : 996352 

<400> 3 

Met Ser Ser Ser Tyr Tyr Val Asn Ala Leu Phe Ser Lys Tyr Thr 

15 10 15 

Ala Gly Ala Ser Leu Phe Gin Asn Ala Glu Pro Thr Ser Cys Ser 

20 25 30 

Phe Ala Pro Asn Ser Gin Arg Ser Gly Tyr Gly Ala Gly Ala Gly 

35 40 45 

Ala Phe Ala Ser Thr Val Pro Gly Leu Tyr Asn Val Asn Ser Pro 

50 55 60 

Leu Tyr Gin Ser Pro Phe Ala Ser Gly Tyr Gly Leu Gly Ala Asp 

65 70 75 

Ala Tyr Gly Asn Leu Pro Cys Ala Ser Tyr Asp Gin Asn lie Pro 

80 85 90 

Gly Leu Cys Ser Asp Leu Ala Lys Gly Ala Cys Asp Lys Thr Asp 

95 100 105 

Glu Gly Ala Leu His Gly Ala Ala Glu Ala Asn Phe Arg lie Tyr 

110 115 120 

Pro Trp Met Arg Ser Ser Gly Pro Asp Arg Lys Arg Gly Arg Gin 

125 130 135 

Thr Tyr Thr Arg Tyr Gin Thr Leu Glu Leu Glu Lys Glu Phe His 

140 145 150 

Phe Asn Arg Tyr Leu lie Arg Arg Arg Arg lie Glu lie Ala His 

155 160 165 

Ala Leu Cys Leu Thr Glu Arg Gin lie Lys lie Trp Phe Gin Asn 

170 175 180 

Arg Arg Met Lys Trp Lys Lys Glu His Lys Asp Glu Gly Pro Thr 

185 190 195 

Ala Ala Ala Ala Pro Glu Gly Ala Val Pro Ser Ala Ala Ala Thr 

200 205 210 

Ala Ala Ala Asp Lys Ala Asp Glu Glu Asp Asp Asp Glu Glu Glu 

215 220 225 
Glu Asp Glu Glu Glu 
230 
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<210> 4 

c211> 131 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte Clone No. : 1273778 

<400> 4 

Met Ser Gin Val Thr Phe Ser Asp Val Ala He Asp Phe Ser His 

15 10 15 

Glu Glu Trp Ala Cys Leu Asp Ser Ala Gin Arg Asp Leu Tyr Lys 

20 25 , v 30 

Asp Val Met Val Gin Asn Tyr Glu Asn Leu Val Ser Val Gly Leu 

35 40 45 

Ser Val Thr Lys Pro Tyr Val He Met Leu Leu Glu Asp Gly Lys 

50 55 60 

Glu Pro Trp Met Met Glu Lys Lys Leu Ser Lys Ala Tyr Pro Phe 

65 70 75 

Pro Leu Ser His Ser Val Pro Ala Ser Val Asn Phe Gly Phe Ser 

80 85 90 

Ala Leu Phe Glu His Cys Ser Glu Val Thr Glu He Phe Glu Leu 

95 100 105 

Ser Glu Leu Cys Val Phe Trp Val Leu His Phe Leu Ser Asn Ser 

110 115 120 

Pro Asn Ser Thr Val Glu Ala Phe Phe Lys Lys 

125 130 



<210> 5 
<211> 411 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Clone No. : 1509715 

<400> 5 

Met Ser Lys Arg Pro Ser Tyr Ala Pro Pro Pro Thr Pro Ala Pro 

15 10 15 

Ala Thr Gin Met Pro Ser Thr Pro Gly Phe Val Gly Tyr Asn Pro 

20 25 30 

Tyr Ser His Leu Ala Tyr Asn Asn Tyr Arg Leu Gly Gly Asn Pro 

35 40 45 

Gly Thr Asn Ser Arg Val Thr Ala Ser Ser Gly He Thr He Pro 

50 55 60 

Lys Pro Pro Lys Pro Pro Asp Lys Pro Leu Met Pro Tyr Met Arg 

65 70 75 

Tyr Ser Arg Lys Val Trp Asp Gin Val Lys Ala Ser Asn Pro Asp 

80 85 90 
Leu Lys Leu Trp Glu He Gly Lys He He Gly Gly Met Trp Arg 
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95 100 105 

Asp Leu Thr Asp Glu Glu Lys Gin Glu Tyr Leu Asn Glu Tyr Glu 

110 115 120 

Ala Glu Lys lie Glu Tyr Asn Glu Ser Met Lys Ala Tyr His Asn 

125 130 135 

Ser Pro Ala Tyr Leu Ala Tyr lie Asn Ala Lys Ser Arg Ala Glu 

140 145 150 

Ala Ala Leu Glu Glu Glu Ser Arg Gin Arg Gin Ser Arg Met Glu 

155 160 165 

Lys Gly Glu Pro Tyr Met Ser lie Gin Pro Ala Glu Asp Pro Asp 

170 175 180 

Asp Tyr Asp Asp Gly Phe Ser Met Lys His Thr Ala Thr Ala Arg 

185 190 195 

Phe Gin Arg Asn His Arg Leu lie Ser Glu lie Leu Ser Glu Ser 

200 205 210 

Val Val Pro Asp Val Arg Ser Val Val Thr Thr Ala Arg Met Gin 

215 220 225 

Val Leu Lys Arg Gin Val Gin Ser Leu Met Val His Gin Arg Lys 

230 235 240 

Leu Glu Ala Glu Leu Leu Gin lie Glu Glu Arg His Gin Glu Lys 

245 250 255 

Lys Arg Lys Phe Leu Glu Ser Thr Asp Ser Phe Asn Asn Glu Leu 

260 265 270 

Lys Arg Leu Cys Gly Leu Lys Val Glu Val Asp Met Glu Lys lie 

275 280 285 

Ala Ala Glu lie Ala Gin Ala Glu Glu Gin Ala Arg Lys Arg Gin 

290 295 300 

Glu Glu Arg Glu Lys Glu Ala Ala Glu Gin Ala Glu Arg Ser Gin 

305 310 315 

Ser Ser lie Val Pro Glu Glu Glu Gin Ala Ala Asn Lys Gly Glu 

320 325 330 

Glu Lys Lys Asp Asp Glu Asn lie Pro Met Glu Thr Glu Glu Thr 

335 340 345 

His Leu Glu Glu Thr Thr Glu Ser Gin Gin Asn Gly Glu Glu Gly 

350 355 360 

Thr Ser Thr Pro Glu Asp Lys Glu Ser Gly Gin Glu Gly Val Asp 

365 370 375 

Ser Met Ala Glu Glu Gly Thr Ser Asp Ser Asn Thr Gly Ser Glu 

380 385 390 

Ser Asn Ser Ala Thr Val Glu Glu Pro Pro Thr Asp Pro lie Pro 

395 400 405 
Glu Asp Glu Lys Lys Glu 

410 



<210> 6 

<211> 226 

<212> PRT 

<213> Homo sapiens 

<220> 

<22 1> unsure 

<222> 221, 228 

<223> unknown or other 

<220> 
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<221> mi sc_ feature 

<223> Incyte Clone No.: 1676367 
<400> 6 

Met Ala Ala Lys Val Asp Leu Ser Thr Ser Thr Asp Trp Lys Glu 

15 10 15 

Ala Lys Ser Phe Leu Lys Gly Leu Ser Asp Lys Gin Arg Glu Glu 

20 25 30 

His Tyr Phe Cys Lys Asp Phe Val Arg Leu Lys Lys lie Pro Thr 

35 40 45 

Trp Lys Glu Met Ala Lys Gly Val Ala Val Lys Val Glu Glu Pro 

50 55 60 

Arg Tyr Lys Lys Asp Lys Gin Leu Asn Glu Lys lie Ser Leu Leu 

65 70 75 

Arg Ser Asp lie Thr Lys Leu Glu Val Asp Ala lie Val Asn Ala 

80 85 90 

Ala Asn Ser Ser Leu Leu Gly Gly Gly Gly Val Asp Gly Cys lie 

95 100 105 

His Arg Ala Ala Gly Pro Leu Leu Thr Asp Glu Cys Arg Thr Leu 

110 115 120 

Gin Ser Cys Lys Thr Gly Lys Ala Lys lie Thr Gly Gly Tyr Arg 

125 130 135 

Leu Pro Ala Lys Tyr Val lie His Thr Val Gly Pro lie Ala Tyr 

140 145 150 

Gly Glu Pro Ser Ala Ser Gin Ala Ala Glu Leu Arg Ser Cys Tyr 

155 160 165 

Leu Ser Ser Leu Asp Leu Leu Leu Glu His Arg Leu Arg Ser Val 

170 175 180 

Ala Phe Pro Cys lie Ser Thr Gly Val Phe Gly Tyr Pro Cys Glu 

185 190 195 

Ala Ala Ala Glu lie Val Leu Ala Thr Leu Arg Glu Trp Leu Gly 

200 205 210 

Ser Ser Thr Arg Glu Pro Arg Xaa Asn Leu Asn Phe Xaa Glu Pro 

215 220 225 

Gly 



<210> 7 
<211> 183 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Clone No.: 1734119 

<400> 7 

Met Gly Arg Leu Cys Cys Leu Arg Pro Pro Pro His Arg Asp Pro 

15 10 15 

Ala Arg Leu Leu Leu Ala Ser Thr Asp Asp Lys Arg Asn Ser Pro 

20 25 30 

Lys lie Arg Pro Leu Gin Pro Ala Val Pro Ala Cys Leu Pro Ala 

35 40 45 

Thr Val Arg Pro Ala Leu Ala Ser Ser Ser Ala Gly Leu Ser Ala 

50 55 60 
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Gly Phe Trp Gly Gin Lys Ser Gly Glu Pro Arg Gly Arg Val Arg 

65 70 75 

Gly Asp Gin Val Arg Ala Ala Thr Phe Leu Val lie Ser Pro Met 

80 85 90 

Gly Arg Arg Gly Trp Arg Asp Thr Ala Pro Pro Gly Phe Pro Thr 

95 100 105 

Pro Leu Leu Ser His Pro Glu Ala Ser Phe Phe Cys Ala Arg Cys 

110 115 120 

Leu Pro Lys Arg Val Gly Ala Arg Ser Pro Pro Trp Arg Val Leu 

125 130 135 

Gly Pro Gly Gly Ala Leu Gly Glu Gin Met Gly Pro Pro Leu Ala 

140 145 150 

Gly Pro Leu Gin Leu Phe Pro Ala Ala Glu Pro Ser Gly Gly Pro 

155 160 165 

Val Leu Val Ala Ser Leu Arg Ala Gin lie Ala Gin Gly Asp Leu 

170 175 180 

Ala Val Ala 



<210> 8 
<211> 317 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Clone No.: 1944813 

<400> 8 

Met Lys Ser Asp Cys Met Gin Thr Thr lie Cys Gin Glu Arg Lys 

15 10 15 

Lys Asp Pro lie Glu Met Phe His Ser Gly Gin Leu Val Lys Val 

20 25 30 

Cys Ala Pro Met Val Arg Tyr Ser Lys Leu Ala Phe Arg Thr Leu 

35 40 45 

Val Arg Lys Tyr Ser Cys Asp Leu Cys Tyr Thr Pro Met lie Val 

50 55 60 

Ala Ala Asp Phe Val Lys Ser lie Lys Ala Arg Asp Ser Glu Phe 

65 70 75 

Thr Thr Asn Gin Gly Asp Cys Pro Leu lie Val Gin Phe Ala Ala 

80 85 90 

Asn Asp Ala Arg Leu Leu Ser Asp Ala Ala Arg lie Val Cys Pro 

95 100 105 

Tyr Ala Asn Gly lie Asp lie Asn Cys Gly Cys Pro Gin Arg Trp 

110 115 120 

Ala Met Ala Glu Gly Tyr Gly Ala Cys Leu lie Asn Lys Pro Glu 

125 130 135 

Leu Val Gin Asp Met Val Lys Gin Val Arg Asn Gin Val Glu Thr 

140 145 150 

Pro Gly Phe Ser Val Ser lie Lys lie Arg lie His Asp Asp Leu 

155 160 165 

Lys Arg Thr Val Asp Leu Cys Gin Lys Ala Glu Ala Thr Gly Val 

170 175 180 

Ser Trp lie Thr Val His Gly Arg Thr Ala Glu Glu Arg His Gin 

185 190 195 
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Pro 


Val 


His 


Tyr 


Asp 
200 


Ser 


He 


Lys 


He 


He 
205 


Lys 


Glu 


Asn 


Met 


Ser 
210 


lie 


Pro 


Val 


He 


Ala 
215 


Asn 


Gly 


Asp 


He 


Arg 
220 


Ser 


Leu 


Lys 


Glu 


Ala 
225 


Glu 


Asn 


Val 


Trp 


Arg 

230 


He 


Thr 


Gly 


Thr 


Asp 
235 


Gly 


Val 


Met 


Val 


Ala 
240 


Arcr 


Glv 


Leu 


Leu 


Ala 
245 


Asn 


Pro 


Ala 


Met 


Phe 
250 


Ala 


Gly 


Tyr 


Glu 


Glu 
255 


Thr 


Pro 


Leu 


Lys 


Cys 
260 


He 


Trp 


Asp 


Trp 


Val 

265 


Asp 


He 


Ala 


Leu 


Glu 
270 


Leu 


Gly 


Thr 


Pro 


Tyr 
275 


Met 


Cys 


Phe 


His 


Gin 
280 


His 


Leu 


Met 


Tyr 


Met 
285 


Met 


Glu 


Lys 


He 


Thr 
290 


Ser 


Arg 


Gin 


Glu 


Lys 
295 


Arg 


Val 


Phe 


Asn 


Ala 
300 


Leu 


Ser 


Ser 


Thr 


Ser 
305 


Ala 


He 


He 


Asp 


Tyr 
310 


Leu 


Thr 


Asp 


His 


Tyr 

315 



Gly He 



<210> 9 
<211> 479 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 2683322 
<400> 9 

Met Ala Thr Asp Ser Trp Ala Leu Ala Val Asp Glu Gin Glu Ala 

15 10 15 

Ala Ala Glu Ser Leu Ser Asn Leu His Leu Lys Glu Glu Lys He 

20 25 30 

Lys Pro Asp Thr Asn Gly Ala Val Val Lys Thr Asn Ala Asn Ala 

35 40 45 

Glu Lys Thr Asp Glu Glu Glu Lys Glu Asp Arg Ala Ala Gin Ser 

50 55 60 

Leu Leu Asn Lys Leu He Arg Ser Asn Leu Val Asp Asn Thr Asn 

65 70 75 

Gin Val Glu Val Leu Gin Arg Asp Pro Asn Ser Pro Leu Tyr Ser 

80 85 90 

Val Lys Ser Phe Glu Glu Leu Arg Leu Lys Pro Gin Leu Leu Gin 

95 100 105 

Gly Val Tyr Ala Met Gly Phe Asn Arg Pro Ser Lys He Gin Glu 

110 115 120 

Asn Ala Leu Pro Met Met Leu Ala Glu Pro Pro Gin Asn Leu He 

125 130 135 

Ala Gin Ser Gin Ser Gly Thr Gly Lys Thr Ala Ala Phe Val Leu 

140 145 150 

Ala Met Leu Ser Arg Val Glu Pro Ser Asp Arg Tyr Pro Gin Cys 

155 160 165 

Leu Cys Leu Ser Pro Thr Tyr Glu Leu Ala Leu Gin Thr Gly Lys 

170 175 180 
Val He Glu Gin Met Gly Lys Phe Tyr Pro Glu Leu Lys Leu Ala 

185 190 195 
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Tyr Ala Val Arg Gly Asn Lys Leu Glu Arg Gly Gin Lys lie Ser 

200 205 210 

Glu Gin lie Val He Gly Thr Pro Gly Thr Val Leu Asp Trp Cys 

215 220 225 

Ser Lys Leu Lys Phe He Asp Pro Lys Lys He Lys Val Phe Val 

230 235 240 

Leu Asp Glu Ala Asp Val Met He Ala Thr Gin Gly His Gin Asp 

245 250 255 

Gin Ser He Arg He Gin Arg Met Leu Pro Arg Asn Cys Gin Met 

260 265 270 

Leu Leu Phe Ser Ala Thr Phe Glu Asp Ser Val Trp Lys Phe Ala 

275 280 285 

Gin Lys Val Val Pro Asp Pro Asn Val He Lys Leu Lys Arg Glu 

290 295 300 

Glu Glu Thr Leu Asp Thr He Lys Gin Tyr Tyr Val Leu Cys Ser 

305 310 315 

Ser Arg Asp Glu Lys Phe Gin Ala Leu Cys Asn Leu Tyr Gly Ala 

320 325 330 

He Thr He Ala Gin Ala Met lie Phe Cys His Thr Arg Lys Thr 

335 340 345 

Ala Ser Trp Leu Ala Ala Glu Leu Ser Lys Glu Gly His Gin Val 

350 355 360 

Ala Leu Leu Ser Gly Glu Met Met Val Glu Gin Arg Ala Ala Val 

365 370 375 

He Glu Arg Phe Arg Glu Gly Lys Glu Lys Val Leu Val Thr Thr 

380 385 390 

Asn Val Cys Ala Arg Gly He Asp Val Glu Gin Val Ser Val Val 

395 400 405 

He Asn Phe Asp Leu Pro Val Asp Lys Asp Gly Asn Pro Asp Asn 

410 415 420 

Glu Thr Tyr Leu His Arg He Gly Arg Thr Gly Arg Phe Gly Lys 

425 430 435 

Arg Gly Leu Ala Val Asn Met Val Asp Ser Lys His Ser Met Asn 

440 445 450 

lie Leu Asn Arg lie Gin Glu His Phe Asn Lys Lys He Glu Arg 

455 460 465 

Leu Asp Thr Asp Asp Leu Asp Glu He Glu Lys He Ala Asn 

470 475 



<210> 10 
<211> 582 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 2684552 

<400> 10 

Met Ala Glu Phe Leu Asp Asp Gin 

1 5 
Cys Lys Lys Glu He Pro Val Phe 
20 

His Cys Gin Arg Asn He Gly Met 



Glu Thr Arg Leu Cys Asp Asn 
10 15 

Asn Phe Thr He His Glu He 
25 30 

Cys Pro Thr Cys Lys Glu Pro 
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35 40 45 

Phe Pro Lys Ser Asp Met Glu Thr His Met Ala Ala Glu His Cys 
50 55 60 

Gin Val Thr Cys Lys Cys Asn Lys Lys Leu Glu Lys Arg Leu Leu 
65 70 75 

Lys Lys His Glu Glu Thr Glu Cys Pro Leu Arg Leu Ala Val Cys 
80 85 90 

Gin His Cys Asp Leu Glu Leu Ser lie Leu Lys Leu Lys Glu His 
95 100 105 

Glu Asp Tyr Cys Gly Ala Arg Thr Glu Leu Cys Gly Asn Cys Gly 
HO 115 120 

Arg Asn Val Leu Val Lys Asp Leu Lys Thr His Pro Glu Val Cys 
125 130 135 

Gly Arg Glu Gly Glu Glu Lys Arg Asn Glu Val Ala He Pro Pro 
140 145 150 

Asn Ala Tyr Asp Glu Ser Trp Gly Gin Asp Gly lie Trp He Ala 
155 160 165 

Ser Gin Leu Leu Arg Gin lie Glu Ala Leu Asp Pro Pro Met Arg 
170 175 180 

Leu Pro Arg Arg Pro Leu Arg Ala Phe Glu Ser Asp Val Phe His 
185 190 195 

Asn Arg Thr Thr Asn Gin Arg Asn He Thr Ala Gin Val Ser He 
200 205 210 

Gin Asn Asn Leu Phe Glu Glu Gin Glu Arg Gin Glu Arg Asn Arg 
215 220 225 

Gly Gin Gin Pro Pro Lys Glu Gly Gly Glu Glu Ser Ala Asn Leu 
230 235 240 

Asp Phe Met Leu Ala Leu Ser Leu Gin Asn Glu Gly Gin Ala Ser 
245 250 255 

Ser Val Ala Glu Gin Asp Phe Trp Arg Ala Val Cys Glu Ala Asp 
260 265 270 

Gin Ser His Gly Gly Pro Arg Ser Leu Ser Asp He Lys Gly Ala 
275 280 285 

Ala Asp Glu He Met Leu Pro Cys Glu Phe Cys Glu Glu Leu Tyr 
290 295 300 

Pro Glu Glu Leu Leu He Asp His Gin Thr Ser Cys Asn Pro Ser 
305 310 315 

Arg Ala Leu Pro Ser Leu Asn Thr Gly Ser Ser Ser Pro Arg Gly 
320 325 330 

Val Glu Glu Pro Asp Val lie Phe Gin Asn Phe Leu Gin Gin Ala 
335 340 345 

Ala Ser Asn Gin Leu Asp Ser Leu Met Gly Leu Ser Asn Ser His 
350 355 360 

Pro Val Glu Glu Ser He He He Pro Cys Glu Phe Cys Gly Val 
365 370 375 

Gin Leu Glu Glu Glu Val Leu Phe His His Gin Asp Gin Cys Asp 
380 385 390 

Gin Arg Pro Ala Thr Ala Thr Asn His Val Thr Glu Gly He Pro 
395 400 405 

Arg Leu Asp Ser Gin Pro Gin Glu Thr Ser Pro Glu Leu Pro Arg 
410 415 420 

Arg Arg Val Arg His Gin Gly Asp Leu Ser Ser Gly Tyr Leu Asp 
425 430 435 

Asp Thr Lys Gin Glu Thr Ala Asn Gly Pro Thr Ser Cys Leu Pro 
440 445 450 

Pro Ser Arg Pro He Asn Asn Met Thr Ala Thr Tyr Asn Gin Leu 
455 460 465 
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Ser Arg Ser Thr Ser Gly Pro Arg Pro Gly Cys Gin Pro Ser Ser 

470 475 480 

Pro Cys Val Pro Lys Leu Ser Asn Ser Asp Ser Gin Asp lie Gin 

485 490 495 

Gly Arg Asn Arg Asp Ser Gin Asn Gly Ala lie Ala Pro Gly His 

500 505 510 

Val Ser Val lie Arg Pro Pro Gin Asn Leu Tyr Pro Glu Asn lie 

515 520 525 

Val Pro Ser Phe Ser Pro Gly Pro Ser Gly Arg Tyr Gly Ala Ser 

530 535 540 

Gly Arg Ser Glu Gly Gly Arg Asn Ser Arg Val Thr Pro Ala Ala 

545 550 555 

Ala Asn Tyr Arg Ser Arg Thr Ala Lys Ala Lys Pro Ser Lys Gin 

560 565 570 

Gin Gly Ala Gly Asp Ala Glu Glu Glu Glu Glu Glu 

575 580 



<210> 11 
<211> 327 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 2830310 



<400> 11 
Met Arg Trp 
1 

Ser Ser Arg 
Glu Ser Arg 
Leu Ser Gin 
Ala Glu Gly 
Val lie Lys 
Lys Thr Lys 
Asp Ser Glu 
Leu Ser Gin 
Cys Arg Arg 
Leu Lys Glu 
Glu Val Gin 
Glu Ser Leu 
Lys Pro Asp 



Pro Gly His 

5 

His Phe Ser 
20 

Thr Gin Phe 
35 

Thr Ser Leu 
50 

Asp Glu Pro 
65 

Arg Asn Trp 
80 

lie Leu Gly 
95 

Asn Lys Phe 

110 
Asp Leu Leu 

125 
Pro Val Leu 

140 
lie Lys His 

155 
Glu Asp His 

170 
Gly Tyr His 

185 
Gly Cys Ala 



Tyr Ser Arg 
Leu Asn Trp 
Gin Tyr Cys 
lie His Leu 
Ser Ser Lys 
Glu Tyr lie 
Asp Lys Asn 
Asp Phe Ser 
Glu Asp Asn 
His Trp Ser 
Phe Asp Ala 
Tyr Gly Ala 
Cys Glu Tyr 
lie Cys Phe 



Ala Pro Tyr 
10 

Arg Pro Pro 
25 

Asn Trp Arg 
40 

Ser Ser Tyr 
55 

Arg Arg Lys 
70 

Cys Ser His 
85 

Val Asp Pro 
100 

Val Met Ser 
115 

Ser His Leu 
130 

Phe Arg Phe 
145 

Asp Val Leu 
160 

Glu lie Arg 
175 

Lys Met Arg 
190 

Lys His Ser 



Pro Tyr Phe 
15 

Cys Leu Phe 
30 

Pro Asp Asn 
45 

Val Met Asn 
60 

His Gin Gly 
75 

Asp Lys Glu 
90 

Lys Cys Glu 
105 

Tyr Asn lie 
120 

Tyr Arg His 
135 

Pro Asn lie 
150 

Cys Leu Gin 
165 

Pro Ser Leu 
180 

Thr Gly Arg 
195 

Lys Phe Ser 
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200 205 210 

Leu Leu Ser Val Asn Pro Val Glu Phe Phe Arg Pro Asp lie Ser 

215 220 225 

Leu Leu Asp Arg Asp Asn Val Gly Leu Val Leu Leu Leu Gin Pro 

230 235 240 

Lys lie Pro Tyr Ala Ala Cys Pro Ala lie Cys Val Ala Asn Thr 

245 250 255 

His Leu Leu Tyr Asn Pro Arg Arg Gly Asp lie Lys Leu Thr Gin 

260 265 270 

Leu Ala Met Leu Leu Ala Glu lie Ser Ser Val Ala His Gin Lys 

275 280 285 

Asp Gly Ser Phe Cys Pro lie Val Met Cys Gly Asp Phe Asn Ser 

290 295 300 

Val Pro Gly Ser Pro Leu Tyr Ser Phe He Lys Glu Gly Lys Leu 

305 310 315 

Asn Tyr Glu Gly Leu Pro He Gly Lys Thr Val lie 

320 325 



<210> 12 
<211> 502 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Clone No. : 2963346 
<400> 12 

Met Ala Ser Lys Lys Leu Gly Ala Asp Phe His Gly Thr Phe Ser 

15 10 15 

Tyr Leu Asp Asp Val Pro Phe Lys Thr Gly Asp Lys Phe Lys Thr 

20 25 30 

Pro Ala Lys Val Gly Leu Pro He Gly Phe Ser Leu Pro Asp Cys 

35 40 45 

Leu Gin Val Val Arg Glu Val Gin Tyr Asp Phe Ser Leu Glu Lys 

50 55 60 

Lys Thr lie Glu Trp Ala Glu Glu He Lys Lys He Glu Glu Ala 

65 70 75 

Glu Arg Glu Ala Glu Cys Lys He Ala Glu Ala Glu Ala Lys Val 

80 85 90 

Asn Ser Lys Ser Gly Pro Glu Gly Asp Ser Lys Met Ser Phe Ser 

95 100 105 

Lys Thr His Ser Thr Ala Thr Met Pro Pro Pro He Asn Pro He 

110 115 120 

Leu Ala Ser Leu Gin His Asn Ser He Leu Thr Pro Thr Arg Val 

125 130 135 

Ser Ser Ser Ala Thr Lys Gin Lys Val Leu Ser Pro Pro His He 

140 145 150 

Lys Ala Asp Phe Asn Leu Ala Asp Phe Glu Cys Glu Glu Asp Pro 

155 160 165 

Phe Asp Asn Leu Glu Leu Lys Thr He Asp Glu Lys Glu Glu Leu 

170 175 180 

Arg Asn He Leu Val Gly Thr Thr Gly Pro He Met Ala Gin Leu 

185 190 195 
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Leu Asp Asn Asn Leu Pro Arg Gly Gly Ser Gly Ser Val Leu Gin 

200 205 210 

Asp Glu Glu Val Leu Ala Ser Leu Glu Arg Ala Thr Leu Asp Phe 

215 220 225 

Lys Pro Leu His Lys Pro Asn Gly Phe lie Thr Leu Pro Gin Leu 

230 235 240 

Gly Asn Cys Glu Lys Met Ser Leu Ser Ser Lys Val Ser Leu Pro 

245 250 255 

Pro lie Pro Ala Val Ser Asn lie Lys Ser Leu Ser Phe Pro Lys 

260 265 270 

Leu Asp Ser Asp Asp Ser Asn Gin Lys Thr Ala Lys Leu Ala Ser 

275 280 285 

Thr Phe His Ser Thr Ser Cys Leu Arg Asn Gly Thr Phe Gin Asn 

290 295 300 

Ser Leu Lys Pro Ser Thr Gin Ser Ser Ala Ser Glu Leu Asn Gly 

305 310 315 

His His Thr Leu Gly Leu Ser Ala Leu Asn Leu Asp Ser Gly Thr 

320 325 330 

Glu Met Pro Ala Leu Thr Ser Ser Gin Met Pro Ser Leu Ser Val 

335 340 345 

Leu Ser Val Cys Thr Glu Glu Ser Ser Pro Pro Asn Thr Gly Pro 

350 355 360 

Thr Val Thr Pro Pro Asn Phe Ser Val Ser Gin Val Pro Asn Met 

365 370 375 

Pro Ser Cys Pro Gin Ala Tyr Ser Glu Leu Gin Met Leu Ser Pro 

380 385 390 

Ser Glu Arg Gin Cys Val Glu Thr Val Val Asn Met Gly Tyr Ser 

395 400 405 

Tyr Glu Cys Val Leu Arg Ala Met Lys Lys Lys Gly Glu Asn lie 

410 415 420 

Glu Gin lie Leu Asp Tyr Leu Phe Ala His Gly Gin Leu Cys Glu 

425 430 435 

Lys Gly Phe Asp Pro Leu Leu Val Glu Glu Ala Leu Glu Met His 

440 445 450 

Gin Cys Ser Glu Glu Lys Met Met Glu Phe Leu Gin Leu Met Ser 

455 460 465 

Lys Phe Lys Glu Met Gly Phe Glu Leu Lys Asp lie Lys Glu Val 

470 475 480 

Leu Leu Leu His Asn Asn Asp Gin Asp Asn Ala Leu Glu Asp Leu 

485 490 495 
Met Ala Arg Ala Gly Ala Ser 

500 



<210> 13 
<211> 375 
<212> PRT 

<213> Homo sapiens 
<:220> 

<221> misc_feature 

<223> Incyte Clone No. : 2994234 

<400> 13 

Met Leu Trp Lys Leu Leu Leu Arg Ser Gin Ser Cys Arg Leu Cys 
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l 



5 



10 



15 



Ser Phe Arg Lys Met Arg Ser Pro Pro Lys Tyr Arg Pro Phe Leu 

20 25 30 

Ala Cys Phe Thr Tyr Thr Thr Asp Lys Gin Ser Ser Lys Glu Asn 

35 40 45 

Thr Arg Thr Val Glu Lys Leu Tyr Lys Cys Ser Val Asp lie Arg 

50 55 60 

Lys lie Arg Arg Leu Lys Gly Trp Val Leu Leu Glu Asp Glu Thr 

65 70 75 

Tyr Val Glu Glu lie Ala Asn lie Leu Gin Glu Leu Gly Ala Asp 

80 85 90 

Glu Thr Ala Val Ala Ser lie Leu Glu Arg Cys Pro Glu Ala lie 

95 100 105 

Val Cys Ser Pro Thr Ala Val Asn Thr Gin Arg Lys Leu Trp Gin 

110 115 120 

Leu Val Cys Lys Asn Glu Glu Glu Leu lie Lys Leu lie Glu Gin 

125 130 135 

Phe Pro Glu Ser Phe Phe Thr lie Lys Asp Gin Glu Asn Gin Lys 

140 145 150 

Leu Asn Val Gin Phe Phe Gin Glu Leu Gly Leu Lys Asn Val Val 

155 160 165 

lie Ser Arg Leu Leu Thr Ala Ala Pro Asn Val Phe His Asn Pro 

170 175 180 

Val Glu Lys Asn Lys Gin Met Val Arg lie Leu Gin Glu Ser Tyr 

185 190 195 

Leu Asp Val Gly Gly Ser Glu Ala Asn Met Lys Val Trp Leu Leu 

200 205 210 

Lys Leu Leu Ser Gin Asn Pro Phe lie Leu Leu Asn Ser Pro Thr 

215 220 225 

Ala lie Lys Glu Thr Leu Glu Phe Leu Gin Glu Gin Gly Phe Thr 

230 235 240 

Ser Phe Glu lie Leu Gin Leu Leu Ser Lys Leu Lys Gly Phe Leu 

245 250 255 

Phe Gin Leu Cys Pro Arg Ser lie Gin Asn Ser lie Ser Phe Ser 

260 265 270 

Lys Asn Ala Phe Lys Cys Thr Asp His Asp Leu Lys Gin Leu Val 

275 280 285 

Leu Lys Cys Pro Ala Leu Leu Tyr Tyr Ser Val Pro Val Leu Glu 

290 295 300 

Glu Arg Met Gin Gly Leu Leu Arg Glu Gly lie Ser lie Ala Gin 

305 310 315 

lie Arg Glu Thr Pro Met Val Leu Glu Leu Thr Pro Gin lie Val 

320 325 330 

Gin Tyr Arg lie Arg Lys Leu Asn Ser Ser Gly Tyr Arg lie Lys 

335 340 345 

Asp Gly His Leu Ala Asn Leu Asn Gly Ser Lys Lys Glu Phe Glu 

350 355 360 

Ala Asn Phe Gly Lys He Gin Ala Lys Lys Ser Lys Ala He lie 



365 



370 



375 



<210> 14 

<211> 341 

<212> PRT 

<213> Homo 



sapiens 
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<220> 

<221> m±sc_f eature 

<223> Incyte Clone No.: 4115958 

<400> 14 

Met His Asp Ser Ser Ser Val Ala Ser Lys Val Phe Arg Ser Ser 

15 10 15 

Tyr Glu Asp Lys Asn Leu Leu Lys Lys Asn Lys Asp Glu Ser Ser 

20 25 30 

Val Ser He Ser His Thr Lys Cys Ser Leu Leu Gly Asp He Ser 

35 40 45 

Asp Gly Lys Asn Leu He Pro Asn Lys Cys Phe Thr Ser Phe Lys 

50 55 60 

Asn Asn Ser Lys Glu Lys Cys Ser Leu Lys His Gin Thr Arg Asn 

65 70 75 

Gin Cys Gin Asn Asn Pro Ser Glu lie lie Gin Ser Thr Tyr Gin 

80 65 90 

Glu Thr Gin Asn Lys Ser Ser Ser Leu Ser Thr Ser Ser He Leu 

95 100 105 

Ser Gin His Lys Glu Asn Asn Leu Asp Leu Thr Ser Arg Phe Lys 

110 115 120 

Glu Gin Glu Met Ser Asn Gly lie Asp Lys Gin Tyr Ser Asn Cys 

125 130 135 

Thr Thr He Asp Lys Gin He Cys Thr Asn Lys Tyr Lys Glu Lys 

140 145 150 

He He Asn Glu Asn Tyr Asn Pro Lys Phe Phe Gly Asn Leu Gin 

155 160 165 

Ser Asp Asp Ser Lys Lys Asn Asp Ser Lys He Lys Val Thr Val 

170 175 180 

Leu Glu Met Ser Glu Tyr Leu Asn Lys Tyr Glu Ser Met Ser Ser 

185 190 195 

Asn Lys Asp Ser Lys Arg Pro Lys Thr Cys Glu Gin Asn Thr Gin 

200 205 210 

Leu Asn Ser lie Glu Asn Tyr Leu A.sn Lys Asp Asn Glu Gly Phe 

215 220 225 

Lys Cys Lys Lys Ser Asp Gin Leu Lys Asn Glu Gin Asp Lys Gin 

230 235 240 

Glu Asp Pro Thr Asn Glu Lys Ser Gin Asn Tyr Ser Gin Arg Arg 

245 250 255 

Ser He Lys Asp Cys Leu Ser Thr Cys Glu Gin Pro Lys Asn Thr 

260 265 270 

Glu Val Leu Arg Thr Thr Leu Lys His Ser Asn Val Trp Arg Lys 

275 280 285 

His Asn Phe His Ser Leu Asp Gly Thr Ser Thr Arg Ala Phe His 

290 295 300 

Pro Gin Thr Gly Leu Pro Leu Leu Ser Ser Pro Val Pro Gin Arg 

305 310 315 

Lys Thr Gin Ser Gly Cys Phe Asp Leu Asp Ser Ser Leu Leu His 

320 325 330 

Leu Lys Ser Phe Ser Ser Arg Arg Asn Leu Ser 

335 340 



<210> 15 
<211> 269 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte Clone No.: 779255 

<400> 15 

Met His Thr Glu Thr lie Lys Pro His Lys Cys Pro His Cys Ser 

15 10 15 

Lys Thr Phe Ala Asn Thr Ser Tyr Leu Ala Gin His Leu Arg lie 

20 25 30 

His Ser Gly Ala Lys Pro Tyr Asn Cys Ser Tyr Cys Gin Lys Ala 

35 40 45 

Phe Arg Gin Leu Ser His Leu Gin Gin His Thr Arg lie His Thr 

50 55 60 

Gly Asp Arg Pro Tyr Lys Cys Ala His Pro Gly Cys Glu Lys Ala 

65 70 75 

Phe Thr Gin Leu Ser Asn Leu Gin Ser His Arg Arg Gin His Asn 

80 85 90 

Lys Asp Lys Pro Phe Lys Cys His Asn Cys His Arg Ala Tyr Thr 

95 100 105 

Asp Ala Ala Ser Leu Glu Val His Leu Ser Thr His Thr Val Lys 

110 115 120 

His Ala Lys Val Tyr Thr Cys Thr lie Cys Ser Arg Ala Tyr Thr 

125 130 135 

Ser Glu Thr Tyr Leu Met Lys His Met Arg Lys His Asn Pro Pro 

140 145 150 

Asp Leu Gin Gin Gin Val Gin Ala Ala Ala Ala Ala Ala Ala Val 

155 160 165 

Ala Gin Ala Gin Ala Gin Ala Gin Ala Gin Ala Gin Ala Gin Ala 

170 175 180 

Gin Ala Gin Ala Gin Ala Gin Ala Ser Gin Ala Ser Gin Gin Gin 

185 190 195 
Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Pro Pro 

200 205 210 
Pro His Phe Gin Ser Pro Gly Ala Ala Pro Gin Gly Gly Gly Gly 

215 220 225 

Gly Asp Ser Asn Pro Asn Pro Pro Pro Gin Cys Ser Phe Asp Leu 

230 235 240 
Thr Pro Tyr Lys Thr Ala Glu His His Lys Asp lie Cys Leu Thr 

245 250 255 
Val Thr Thr Ser Thr lie Gin Val Glu His Leu Ala Ser Ser 

260 265 



<210> 16 

<211> 264 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 1303605 
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<400> 16 

Met Glu Asn Tyr Gly lie Glu Trp His Ser Val Arg Asp Ser Glu 
15 10 15 

Gly Gin Lys Leu Leu lie Gly Val Gly Pro Glu Gly lie Ser lie 
20 25 30 

Cys Lys Asp Asp Phe Ser Pro lie Asn Arg lie Ala Tyr Pro Val 
35 40 45 

Val Gin Met Ala Thr Gin Ser Gly Lys Asn Val Tyr Leu Thr Val 
50 55 60 

Thr Lys Glu Ser Gly Asn Ser lie Val Leu Leu Phe Lys Met lie 
65 70 75 

Ser Thr Arg Ala Ala Ser Gly Leu Tyr Arg Ala lie Thr Glu Thr 
80 85 90 

His Ala Phe Tyr Arg Cys Asp Thr Val Thr Ser Ala Val Met Met 
95 100 105 

Gin Tyr Ser Arg Asp Leu Lys Gly His Leu Ala Ser Leu Phe Leu 

110 115 120 

Asn Glu Asn lie Asn Leu Gly Lys Lys Tyr Val Phe Asp lie Lys 

125 130 135 

Arg Thr Ser Lys Glu Val Tyr Asp His Ala Arg Arg Ala Leu Tyr 

140 145 150 

Asn Ala Gly Val Val Asp Leu Val Ser Arg Ser Asn Gin Ser Pro 

155 160 165 

Ser His Ser Pro Leu Lys Ser Ser Glu Ser Ser Met Asn Cys Ser 

170 175 180 

Ser Cys Glu Gly Leu Ser Cys Gin Gin Thr Arg Val Leu Gin Glu 

185 190 195 

Lys Leu Arg Lys Leu Lys Glu Ala Met Leu Cys Met Val Cys Cys 

200 205 210 

Glu Glu Glu lie Asn Ser Thr Phe Cys Pro Cys Gly His Thr Val 

215 220 225 

Cys Cys Glu Ser Cys Ala Ala Gin Leu Gin Ser Cys Pro Val Cys 

230 235 240 

Arg Ser Arg Val Glu His Val Gin His Val Tyr Leu Pro Thr His 

245 250 255 

Thr Ser Leu Leu Asn Leu Thr Val lie 

260 



<210> 17 
<211> 605 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 1611167 

<400> 17 

Met Ala Ala Gly Gin Arg Glu Ala Arg Pro Gin Val Ser Leu Thr 

15 10 15 

Phe Glu Asp Val Ala Val Leu Phe Thr Arg Asp Glu Trp Arg Lys 

20 25 30 

Leu Ala Pro Ser Gin Arg Asn Leu Tyr Arg Asp Val Met Leu Glu 

35 40 45 
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Asn Tyr Arg Asn Leu Val Ser Leu Gly Leu Pro Phe Thr Lys Pro 
50 55 60 

Lys Val lie Ser Leu Leu Gin Gin Gly Glu Asp Pro Trp Glu Val 
65 70 75 

Glu Lys Asp Gly Ser Gly Val Ser Ser Leu Gly Ser Lys Ser Ser 
80 85 90 

His Lys Thr Thr Lys Ser Thr Gin Thr Gin Asp Ser Ser Phe Gin 
95 100 105 

Gly Leu lie Leu Lys Arg Ser Asn Arg Asn Val Pro Trp Asp Leu 

110 115 120 

Lys Leu Glu Lys Pro Tyr lie Tyr Glu Gly Arg Leu Glu Lys Lys 

125 130 135 

Gin Asp Lys Lys Gly Ser Phe Gin lie Val Ser Ala Thr His Lys 

140 145 150 

Lys lie Pro Thr lie Glu Arg Ser His Lys Asn Thr Glu Leu Ser 

155 160 165 

Gin Asn Phe Ser Pro Lys Ser Val Leu lie Arg Gin Gin lie Leu 

170 175 180 

Pro Arg Glu Lys Thr Pro Pro Lys Cys Glu lie Gin Gly Asn Ser 

185 190 195 

Leu Lys Gin Asn Ser Gin Leu Leu Asn Gin Pro Lys lie Thr Ala 

200 205 210 

Asp Lys Arg Tyr Lys Cys Ser Leu Cys Glu Lys Thr Phe lie Asn 

215 220 225 

Thr Ser Ser Leu Arg Lys His Glu Lys Asn His Ser Gly Glu Lys 

230 235 240 

Leu Phe Lys Cys Lys Glu Cys Ser Lys Ala Phe Ser Gin Ser Ser 

245 250 255 

Ala Leu lie Gin His Gin lie Thr His Thr Gly Glu Lys Pro Tyr 

260 265 270 

lie Cys Lys Glu Cys Gly Lys Ala Phe Thr Leu Ser Thr Ser Leu 

275 280 285 

Tyr Lys His Leu Arg Thr His Thr Val Glu Lys Ser Tyr Arg Cys 

290 295 300 

Lys Glu Cys Gly Lys Ser Phe Ser Arg Arg Ser Gly Leu Phe lie 

305 310 315 

His Gin Lys lie His Ala Glu Glu Asn Pro Cys Lys Tyr Asn Pro 

320 325 330 

Gly Arg Lys Ala Ser Ser Cys Ser Thr Ser Leu Ser Gly Cys Gin 

335 340 345 

Arg lie His Ser Arg Lys Lys Ser Tyr Leu Cys Asn Glu Cys Gly 

350 355 360 

Asn Thr Phe Lys Ser Ser Ser Ser Leu Arg Tyr His Gin Arg lie 

365 370 375 

His Thr Gly Glu Lys Pro Phe Lys Cys Ser Glu Cys Gly Arg Ala 

380 385 390 

Phe Ser Gin Ser Ala Ser Leu lie Gin His Glu Arg lie His Thr 

395 400 405 

Gly Glu Lys Pro Tyr Arg Cys Asn Glu Cys Gly Lys Gly Phe Thr 

410 415 420 

Ser lie Ser Arg Leu Asn Arg His Arg lie lie His Thr Gly Glu 

425 430 435 

Lys Phe Tyr Asn Cys Asn Glu Cys Gly Lys Ala Leu Ser Ser His 

440 445 450 

Ser Thr Leu lie lie His Glu Arg lie His Thr Gly Glu Lys Pro 

455 460 465 

Cys Lys Cys Lys Val Cys Gly Lys Ala Phe Arg Gin Ser Ser Ala 
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470 










475 










480 


Leu 


lie 


Gin 


His 


Gin 
485 


Arg 


Met 


His 


Thr 


Gly 
490 


Glu 


Arg 


Pro 


Tyr 


Lys 
495 


Cvs 


Asn 


Glu 


Cys 


Gly 
500 


Lys 


Thr 


Phe 


Arg 


Cys 
505 


Asn 


Ser 


Ser 


Leu 


Ser 
510 


Asn 


His 


Gin 


Arg 


He 
515 


His 


Thr 


Gly 


Glu 


Lys 
520 


Pro 


Tyr 


Arg 


Cys 


Glu 
525 


Glu 


Cys 


Gly 


He 


Ser 


Phe 


Gly 


Gin 


Ser 


Ser 


Ala 


Leu 


He 


Gin 


His 








530 










535 










540 


Arn 


Arcr 


lie 


His 


Thr 
545 


Gly 


Glu 


Lys 


Pro 


Phe 
550 


Lys 


Cys 


Asn 


Thr 


Cys 
555 


Gly 


Lys 


Thr 


Phe 


Arg 
560 


Gin 


Ser 


Ser 


Ser 


Arg 

565 


lie 


Ala 


His 


Gin 


Arg 
570 


lie 


His 


Thr 


Gly 


Glu 
575 


Lys 


Pro 


Tyr 


Glu 


Cys 
580 


Asn 


Thr 


Cys 


Gly 


Lys 
585 


Leu 


Phe 


Asn 


His 


Arg 
590 


Ser 


Ser 


Leu 


Thr 


Asn 
595 


His 


Tyr 


Lys 


He 


His 
600 


lie 


Glu 


Glu 


Asp 


Pro 
605 























<210> 18 
<211> 757 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Clone No.: 1907472 

<400> 18 

Met Gin Ser Ser Pro Asn Gly Gin Phe Val Ala Pro Ser Asp He 
15 10 15 

Gin Leu Lys Cys Asn Tyr Cys Lys Asn Ser Phe Cys Ser Lys Pro 
20 25 30 

Glu lie Leu Glu Trp Glu Asn Lys Val His Gin Phe Cys Ser Lys 
35 40 45 

Thr Cys Ser Asp Asp Tyr Lys Lys Leu His Cys He Val Thr Tyr 
50 55 60 

Cys Glu Tyr Cys Gin Glu Glu Lys Thr Leu His Glu Thr Val Asn 
65 70 75 

Phe Ser Gly Val Lys Arg Pro Phe Cys Ser Glu Gly Cys Lys Leu 
80 85 90 

Leu Tyr Lys Gin Asp Phe Ala Arg Arg Leu Gly Leu Arg Cys Val 
95 100 105 

Thr Cys Asn Tyr Cys Ser Gin Leu Cys Lys Lys Gly Ala Thr Lys 
110 US 120 

Glu Leu Asp Gly Val Val Arg Asp Phe Cys Ser Glu Asp Cys Cys 
125 130 135 

Lys Lys Phe Gin Asp Trp Tyr Tyr Lys Ala Ala Arg Cys Asp Cys 
140 145 150 

Cys Lys Ser Gin Gly Thr Leu Lys Glu Arg Val Gin Trp Arg Gly 
155 160 165 

Glu Met Lys His Phe Cys Asp Gin His Cys Leu Leu Arg Phe Tyr 
170 175 180 
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Cys 


Gin 


Gin 


Asn 


Glu 


Pro 


Asn 


Met 


Thr 


Thr 


Gin 


Lys 


Gly 


Pro 


Glu 








185 










190 










195 


Asn 


Leu. 


His 


Tyr 


Asp 


Gin 


Gly 


Cys 


Gin 


Thr 


Ser 


Arg 


Thr 


Lys 


Met 










200 










205 










210 


Thr 


Gly 


Ser 


Ala 


Pro 


Pro 


Pro 


Ser 


Pro 


Thr 


Pro 


Asn 


Lys 


Glu 


Met 








215 










220 










225 


Lys 


Asn 


Lys 


Ala 


Val 


Leu 


Cys 


Lys 


Pro 


Leu 


Thr 


Met 


Thr 


Lys 


Ala 










230 










235 










240 


Thr 


Tyr 


Cys 


Lys 


Pro 


His 


Met 


Gin 


Thr 


Lys 


Ser 


Cys 


Gin 


Thr 


Asp 










245 










250 










255 


Asp 


Thr 


Trp 


Arg 


Thr 


Glu 


Tyr 


Val 


Pro 


Val 


Pro 


He 


Pro 


Val 


Pro 










260 










265 










270 


Val 


Tvr 


He 


Pro 


Val 


Pro 


Met 


His 


Met 


Tyr 


Ser 


Gin 


Asn 


He 


Pro 








275 










280 










285 


Val 


Pro 


Thr 


Thr 


Val 


Pro 


Val 


Pro 


Val 


Pro 


Val 


Pro 


Val 


Phe 


Leu 










290 










295 










300 


Pro 


Ala 


Pro 


Leu 


Asp 


Ser 


Ser 


Glu 


Lys 


He 


Pro 


Ala 


Ala 


He 


Glu 










305 










310 










315 


Glu 


Leu 


Lys 


Ser 


Lys 


Val 


Ser 


Ser 


Asp 


Ala 


Leu 


Asp 


Thr 


Glu 


Leu 










320 










325 










330 


Leu 


Thr 


Met 


Thr 


Asp 


Met 


Met 


Ser 


Glu 


Asp 


Glu 


Gly 


Lys 


Thr 


Glu 










335 










340 










345 


Thr 


Thr 


Asn 


He 


Asn 


Ser 


Val 


He 


He 


Glu 


Thr 


Asp 


He 


He 


Gly 










350 










355 










360 


Ser 


Asp 


Leu 


Leu 


Lvs 


Asn 


Ser 


Asp 


Pro 


Glu 


Thr 


Gin 


Ser 


Ser 


Met 










365 










370 










375 


Pro 


Asp 


Val 


Pro 


Tyr 


Glu 


Pro 


Asp 


Leu 


Asp 


lie 


Glu 


He 


Asp 


Phe 








380 










385 










390 


Pro 


Arcr 


Ala 


Ala 


Glu 


Glu 


Leu 


Asp 


Met 


Glu 


Asn 


Glu 


Phe 


Leu 


Leu 










395 










400 










405 


Pro 


Pro 


Val 


Phe 


Gly 


Glu 


Glu 


Tyr 


Glu 


Glu 


Gin 


Pro 


Arg 


Pro 


Arg 










410 










415 










420 


Ser 


Lys 


Lys 


Lys 


Gly 


Ala 


Lys 


Arg 


Lys 


Ala 


Val 


Ser 


Gly 


Tyr 


Gin 










425 










430 










435 


Ser 


His 


Asp 


Asp 


Ser 


Ser 


Asp 


Asn 


Ser 


Glu 


Cys 


Ser 


Phe 


Pro 


Phe 










440 










445 










450 


Lys 


Tvr 


Thr 


Tyr 


Gly 


Val 


Asn 


Ala 


Trp 


Lys 


His 


Trp 


Val 


Lys 


Thr 










455 










460 










465 


Arg 


Gin 


Leu 


Asp 


Glu 


Asp 


Leu 


Leu 


Val 


Leu 


Asp 


Glu 


Leu 


Lys 


Ser 








470 










475 










480 


Ser 


Lys 


Ser 


Val 


Lys 


Leu 


Lys 


Glu 


Asp 


Leu 


Leu 


Ser 


His 


Thr 


Thr 










485 










490 










495 


Ala 


Glu 


Leu 


Asn 


Tyr 


Gly 


Leu 


Ala 


His 


Phe 


Val 


Asn 


Glu 


He 


Arg 










500 










505 










510 


Arcr 


Pro 


Asn 


Gly 


Glu 


Asn 


Tyr 


Ala 


Pro 


Asp 


Ser 


He 


Tyr 


Tyr 


Leu 








515 










520 










525 


Cys 


Leu 


Gly 


He 


Gin 


Glu 


Tyr 


Leu 


Cys 


Gly 


Ser 


Asn 


Arg 


Lys 


Asp 










530 










535 










540 


Asn 


lie 


Phe 


He 


Asp 


Pro 


Gly 


Tyr 


Gin 


Thr 


Phe 


Glu 


Gin 


Glu 


Leu 










545 










550 










555 


Asn 


Lys 


He 


Leu 


Arg 


Ser 


Trp 


Gin 


Pro 


Ser 


He 


Leu 


Pro 


Asp 


Gly 










560 










565 










570 


Ser 


He 


Phe 


Ser 


Arg 


Val 


Glu 


Glu 


Asp 


Tyr 


Leu 


Trp 


Arg 


He 


Lys 










575 










580 










585 


Gin 


Leu 


Gly 


Ser 


His 


Ser 


Pro 


Val 


Ala 


Leu 


Leu 


Asn 


Thr 


Leu 


Phe 










590 










595 










600 


Tyr 


Phe 


Asn 


Thr 


Lys 


Tyr 


Phe 


Gly 


Leu 


Lys 


Thr 


Val 


Glu 


Gin 


His 
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605 610 615 

Leu Arg Leu Ser Phe Gly Thr Val Phe Arg His Trp Lys Lys Asn 

620 625 630 

Pro Leu Thr Met Glu Asn Lys Ala Cys Leu Arg Tyr Gin Val Ser 

635 640 645 

Ser Leu Cys Gly Thr Asp Asn Glu Asp Lys lie Thr Thr Gly Lys 

650 655 660 

Arg Lys His Glu Asp Asp Glu Pro Val Phe Glu Gin lie Glu Asn 

665 670 675 

Thr Ala Asn Pro Ser Arg Cys Pro Val Lys Met Phe Glu Cys Tyr 

680 685 690 

Leu Ser Lys Ser Pro Gin Asn Leu Asn Gin Arg Met Asp Val Phe 

695 700 705 

Tyr Leu Gin Pro Glu Cys Ser Ser Ser Thr Asp Ser Pro Val Trp 

710 715 720 

Tyr Thr Ser Thr Ser Leu Asp Arg Asn Thr Leu Glu Asn Met Leu 

725 730 735 

Val Arg Val Leu Leu Val Lys Asp lie Tyr Asp Lys Asp Asn Tyr 

740 745 750 
Glu Leu Asp Glu Asp Thr Asp 

755 



<210> 19 
<211> 154 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 1985458 

<400> 19 

Met Val Glu Lys Lys Thr Ser Val Arg Ser Gin Asp Pro Gly Gin 

1 5 10 ' 15 

Arg Arg Val Leu Asp Arg Ala Ala Arg Gin Arg Arg lie Asn Arg 

20 25 30 

Gin Leu Glu Ala Leu Glu Asn Asp Asn Phe Gin Asp Asp Pro His 

35 40 45 

Ala Gly Leu Pro Gin Leu Gly Lys Arg Leu Pro Gin Phe Asp Asp 

50 55 60 

Asp Ala Asp Thr Gly Lys Lys Lys Lys Lys Thr Arg Gly Asp His 

65 70 75 

Phe Lys Leu Arg Phe Arg Lys Asn Phe Gin Ala Leu Leu Glu Glu 

80 85 90 

Gin Asn Leu Ser Val Ala Glu Gly Pro Asn Tyr Leu Thr Ala Cys 

95 100 105 

Ala Gly Pro Pro Ser Arg Pro Gin Arg Pro Phe Cys Ala Val Cys 

110 H5 120 

Gly Phe Pro Ser Pro Tyr Thr Cys Val Ser Cys Gly Ala Arg Tyr 

125 130 135 

Cys Thr Val Arg Cys Leu Gly Thr His Gin Glu Thr Arg Cys Leu 

140 145 150 

Lys Trp Thr Val 
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<210> 20 
<211> 587 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte Clone No.: 2726431 



<400> 20 
Met Asp Ser 
1 

Glu Glu Trp 
Asp Val Met 
lie Trp Asp 
Arg Asn Leu 
Asn Asp Gin 
Leu Tyr Lys 
Thr Tyr Gly 
Pro lie Thr 
Cys Gly Gin 
Arg Thr His 
Lys Thr Phe 
His Thr Ala 
Phe lie Asp 
Gly Glu Lys 
Tyr Ser Ser 
Lys Pro Tyr 
Ser Thr Phe 
His Lys Cys 
Phe Arg Arg 
Cys Lys Gin 
Arg His Glu 
Gin Cys Gly 



Val Val Phe 
5 

Ala Leu Leu 
20 

Leu Glu Thr 
35 

Ser Leu Ser 
50 

Ser Arg Asn 
65 

Cys Gly Glu 
80 

Lys lie Pro 
95 

Lys Val Phe 

110 
Val His Thr 

125 
Ala Tyr Ser 

140 
Asn Gly Glu 

155 

Pro Arg Thr 
170 

Glu Lys Thr 

185 
Phe Ser Ser 

200 
Pro Tyr Lys 

215 
Thr Phe Arg 

230 
Lys Cys Lys 

245 
Arg Arg His 

260 
Lys Glu Cys 

275 
His Met lie 

290 
Cys Gly Lys 

305 
Arg lie His 

320 
Lys Thr Phe 



Glu Asp Val 
Asp Ser Ala 
Phe Gin Asn 
lie Glu Asp 
His Gly Leu 
Ala Leu Ser 
Pro Gly Val 
Met His Arg 
Gly His Lys 
Cys Arg Ser 
Arg Pro Tyr 
Ser Ser Leu 
Tyr Glu Cys 
Leu Thr Ser 
Cys Lys Glu 
Arg His Thr 
Glu Cys Ala 
Met lie Ser 
Gly Glu Ala 
Thr His Thr 
Thr Phe lie 
Thr Gly Glu 
lie Tyr Pro 



Ala Val Asp 
10 

Gin Arg Asp 
25 

Leu Ala Ser 
40 

Gin Thr Thr 
55 

Glu Arg Leu 
70 

Gin lie Pro 
85 

Lys Gin Tyr 
100 

Arg Thr Ser 
115 

Pro Tyr Gin 
130 

His Leu Arg 
145 

Val Cys Lys 
160 

Asn Arg His 
175 

Lys Gin Cys 
190 

His Leu Arg 
205 

Cys Gly Lys 
220 

lie Thr His 
235 

Glu Ala Phe 
250 

His Thr Gly 
265 

Phe Ser Tyr 
280 

Gly Glu Lys 
295 

Tyr Leu Gin 
310 

Lys Pro Tyr 
325 

Gin Ser Phe 



Phe Thr Leu 
15 

Leu Tyr Arg 
30 

Val Gly Lys 
45 

Asn Gin Gly 
60 

Cys Glu Ser 
75 

His Leu Asn 
90 

Glu Tyr Asn 
105 

Leu Lys Ser 
120 

Cys Gin Glu 
135 

Met His Val 
150 

Leu Cys Gly 
165 

Val Arg lie 
180 

Gly Lys Ala 
195 

Ser His Thr 
210 

Ala Phe Ser 
225 

Thr Gly Glu 
240 

Ser Tyr Ser 
255 

Glu Lys Pro 
270 

Ser Ser Ala 
285 

Pro Tyr Glu 
300 

Ser Phe Arg 
315 

Glu Cys Lys 
330 

Arg Arg His 
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335 










340 










345 


ulll 


Arg 


j. ni 


His 


Gly 
350 


Gly 


Glu 


Lys 


Pro 


Tvr 
355 


Glu 


Cys 


Asn 


Gin 


Cys 
360 


\_j_L y 


Lys 


Ala 


Phe 


Ser 


His 


Pro 


Ser 


Ser 


Phe 


Arg 


Gly 


His 


Met 


Arg 








365 










370 










375 


Val 


His 


Thr 


Gly 


Glu 
380 


Lys 


Pro 


Tvr 


Glu 


Cys 
385 


Lys 


Gin 


Cys 


Gly 


Lys 
390 


X 111. 


rile 


Asn 


1 x p 


Pro 
395 


He 


Ser 


Leu 


Ara 


LVS 
400 


His 


Met 


Arg 


Thr 


His 
405 


X 11JL 


Arg 


Glu. 


Lys 


Pro 
410 


^ y ± - 


Glu 


Cvs 
— y 


Lvs 


Gin 
415 


Cys 


Gly 


Lys 


Ala 


Phe 
420 




Lsu 


Qot~ 
o t; i 


Ala 


L y b 

425 


Phe 


Arg 


Glu 


His 


Val 
430 


Arg 


Met 


His 


Pro 


Glu 
435 


Asp 


Lys 


Ser 


i y x 


Glu 
440 


Cys 


Lys 


Leu 


Cys 


Gly 
445 


Lys 


Ala 


Phe 


Tyr 


Cys 
450 


nlS 


Tip 

lie 


Ser 




Gin 
455 


Lys 


His 


Met 


Arcj 


Arcr 
460 


His 


Thr 


Ala 


Glu 


Lys 
465 


Leu 


Tyr 


Lys 




470 


Gin 


Cys 


Gly 


Lys 


Ala 
475 


Phe 


Ser 


Trp 


Pro 


Glu 
480 


Leu. 


L6U 


ulil 


Gin 


His 
4 8 5 


Val 


■"■X- y 


Thr 


His 


Thr 
4 90 


Val 


Glu 


Lys 


Pro 


Tyr 
495 


Glu 


Cys 


Lys 




Cys 
500 


uiy 


Lys 


Val 


Phe 


Lys 
505 




Pro 


Ser 


Ser 


Leu 
510 


Pro 


l±e 


His 


Me t 


Arg 
515 


Leu 


M-i O 
xl J. 


X I1X 


Gly 


Glu 
520 


Lys 


Pro 


Tvr 
x y ± - 


Gin 


Cys 
525 


Lys 


His 


Cys 


Gly 


Lys 
530 


Ala 


Phe 


Asn 


Cys 


Ser 
535 


Ser 


Ser 


Leu 


Arg 


Arg 
540 


His 


Val 


Arg 


He 


His 
545 


Thr 


Thr 


Glu 


Lys 


Gin 
550 


Tyr 


Lys 


Cys 


Asn 


Val 
555 


Gly 


His 


Pro 


Pro 


Ala 


Asn 


Glu 


Phe 


Met 


Cys 


Ser 


Ala 


Ser 


Glu 


Lys 








560 










565 










570 


Ser 


His 


Gin 


Glu 


Arg 
575 


Asp 


Leu 


lie 


Lys 


Val 
580 


Val 


Asn 


Met 


Val 


Leu 
585 



Pro Leu 



<210> 21 
<211> 346 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte Clone No.: 2743828 



<400> 21 
























Met 


Ser 


Lys 


Pro 


Arg 


Ala 


Val 


Glu Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


1 








5 








10 










15 


Val 


Ala 


Ala 


Thr 


Ala 


Pro 


Gly 


Pro Glu 


Met 


Val 


Glu 


Arg 


Arg 


Gly 










20 








25 










30 


Pro 


Gly 


Arg 


Pro 


Arg 


Thr 


Asp 


Gly Glu 


Asn 


Val 


Phe 


Thr 


Gly 


Gin 










35 








40 










45 


Ser 


Lys 


He 


Tyr 


Ser 


Tyr 


Met 


Ser Pro 


Asn 


Lys 


Cys 


Ser 


Gly 


Met 










50 








55 










60 


Arg 


Phe 


Pro 


Leu 


Gin 


Glu 


Glu 


Asn Ser 


Val 


Thr 


His 


His 


Glu 


Val 
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65 70 75 

Lys Cys Gin Gly Lys Pro Leu Ala Gly He Tyr Arg Lys Arg Glu 

80 65 90 

Glu Lys Arg Asn Ala Gly Asn Ala Val Arg Ser Ala Met Lys Ser 

95 100 105 

Glu Glu Gin Lys He Lys Asp Ala Arg Lys Gly Pro Leu Val Pro 

110 115 120 

Phe Pro Asn Gin Lys Ser Glu Ala Ala Glu Pro Pro Lys Thr Pro 

125 130 135 

Pro Ser Ser Cys Asp Ser Thr Asn Ala Ala He Ala Lys Gin Ala 

140 145 150 

Leu Lys Lys Pro lie Lys Gly Lys Gin Ala Pro Arg Lys Lys Ala 

155 160 165 

Gin Gly Lys Thr Gin Gin Asn Arg Lys Leu Thr Asp Phe Tyr Pro 

170 175 180 

Val Arg Arg Ser Ser Arg Lys Ser Lys Ala Glu Leu Gin Ser Glu 

185 190 195 

Glu Arg Lys Arg He Asp Glu Leu He Glu Ser Gly Lys Glu Glu 

200 205 210 

Gly Met Lys He Asp Leu He Asp Gly Lys Gly Arg Gly Val He 

215 220 225 

Ala Thr Lys Gin Phe Ser Arg Gly Asp Phe Val Val Glu Tyr His 

230 235 240 

Gly Asp Leu He Glu He Thr Asp Ala Lys Lys Arg Glu Ala Leu 

245 250 255 

Tyr Ala Gin Asp Pro Ser Thr Gly Cys Tyr Met Tyr Tyr Phe Gin 

260 265 270 

Tyr Leu Ser Lys Thr Tyr Cys Val Asp Ala Thr Arg Glu Thr Asn 

275 280 285 

Arg Leu Gly Arg Leu He Asn His Ser Lys Cys Gly Asn Cys Gin 

290 295 300 

Thr Lys Leu His Asp He Asp Gly Val Pro His Leu He Leu He 

305 310 315 

Ala Ser Arg Asp He Ala Ala Gly Glu Glu Leu Leu Tyr Asp Tyr 

320 325 330 

Gly Asp Arg Ser Lys Ala Ser He Glu Ala His Pro Trp Leu Lys 

335 340 345 

His 



<210> 22 
<211> 481 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 2998209 

<400> 22 

Met Asp Phe Gin Arg He Glu Leu 

1 5 
Glu Glu Leu Glu Val Asp Phe Lys 
20 

Arg Met Lys Lys Thr Asp Trp Leu 



Ala Gly Ala Val Gly Ser Lys 
10 15 

Lys Leu Lys Gin He Lys Asn 
25 30 

Phe Leu Asn Ala Cys Val Gly 
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35 










40 










45 


Val 


Val 


Glu 


Glv 


Asp 
50 


Leu 


Ala 


Ala 


He 


Glu 
55 


Ala 


Tyr 


Lys 


Ser 


Ser 
60 


Glv 


Glv 


Asp 


He 


Ala 

65 


Arg 


Gin 


Leu 


Thr 


Ala 
70 


Asp 


Glu 


Val 


Arg 


Leu 
75 


Leu 


Asn 


Arg 


Pro 


Ser 
80 


Ala 


Phe 


Asp 


Val 


Gly 
85 


Tyr 


Thr 


Leu 


Val 


His 
90 


Leu 


Ala 


lie 


Arcr 


Phe 
95 


Gin 


Arg 


Gin 


Asp 


Met 
100 


Leu 


Ala 


He 


Leu 


Leu 
105 


Thr 


Glu 


Val 


Ser 


Gin 
110 


Gin 


Ala 


Ala 


Lys 


Cys 
115 


He 


Pro 


Ala 


Met 


Val 
120 


Cvs 


Piro 


Glu 


Leu 


Thr 
125 


Glu 


Gin 


lie 


Arg 


Arg 
130 


Glu 


He 


Ala 


Ala 


Ser 
135 


Leu. 


His 


Gin 


Arg 


Lvs 
j_ * i *-' 

140 


Glv 


Asp 


Phe 


Ala 


Cys 
145 


Tyr 


Phe 


Leu 


Thr 


Asp 
150 


Leu 


Val 


Thr 


Phe 


Thr 
155 


Leu 


Pro 


Ala 


Asp 


He 
160 


Glu 


Asp 


Leu 


Pro 


Pro 
165 


Thr 


Val 


Gin 


Glu 


Lys 
170 


Leu 


Phe 




Glu 


Val 
175 


Leu 


Asp 


Arg 


Asp 


Val 
180 


Gin 


Lys 


Glu 


Leu 


Glu 
185 


Glu 


Glu 


Ser 


Pro 


He 
190 


He 


Asn 


Trp 


Ser 


Leu 
195 


Glu 


Leu 


Ala 


Thr 


Arg 
200 


Leu 


Asp 


Ser 


Arg 


Leu 
205 


Tvr 


Ala 


Leu 


Trp 


Asn 
210 


Arg 


Thr 


Ala 


Gly 


Asp 
215 


Cvs 


Leu 


Leu 


Asp 


Ser 
220 


Val 


Leu 


Gin 


Ala 


Thr 
225 


r pT*Tl 


Gly 


He 


Tvr 
j 


Asp 
230 


Lys 


Asp 


Ser 


Val 


Leu 
235 


Arg 


Lys 


Ala 


Leu 


His 
240 




Ser 


Leu 


His 


Asp 
245 


Cvs 


Ser 


His 


Trp 


Phe 
250 


Tyr 


Thr 


Arg 


Trp 


Lys 
255 


Asp 




Glu 


Ser 


Tro 
260 


Tvr 


Ser 


Gin 


Ser 


Phe 
265 


Gly 


Leu 


His 


Phe 


Ser 
270 


Leu 


Arg 


Glu 


Glu 


Gin 
275 


Trp 


Gin 


Glu 


Asp 


Trp 
280 


Ala 


Phe 


He 


Leu 


Ser 
285 


Leu 


Ala 


Ser 


Gin 


Pro 
290 


Gly 


Ala 


Ser 


Leu 


Glu 

295 


Gin 


Thr 


His 


He 


Phe 

300 


Va.1 


Leu 


Ala 


His 


lie 
305 


Leu 


Arg 


Arg 


Pro 


He 
310 


He 


Val 


Tyr 


Gly 


Val 
315 


Lys 


Tvr 


Tvr 


Lys 


Ser 
320 


Phe 


Arg 


Gly 


Glu 


Thr 

325 


Leu 


Gly 


Tyr 


Thr 


Arg 
330 


Phe 


Gin 


Glv 


Val 


Tvr 
335 


Leu 


Pro 


Leu 


Leu 


Trp 
340 


Glu 


Gin 


Ser 


Phe 


Cys 
345 


Trr> 


Lys 


Ser 


Pro 


He 
350 


Ala 


Leu 


Gly 


Tyr 


Thr 
355 


Arg 


Gly 


His 


Phe 


Ser 
360 


Ala 


Leu 


Val 


Ala 


Met 
365 


Glu 


Asn 


Asp 


Gly 


Tyr 
370 


Gly 


Asn 


Arg 


Gly 


Ala 
375 


Gly 


Ala 


Asn 


Leu 


Asn 
380 


Thr 


Asp 


Asp 


Asp 


Val 
385 


Thr 


He 


Thr 


Phe 


Leu 
390 


Pro 


Leu 


Val 


Asp 


Ser 
395 


Glu 


Arg 


Lys 


Leu 


Leu 
400 


His 


Val 


His 


Phe 


Leu 
405 


Ser 


Ala 


Gin 


Glu 


Leu 
410 


Gly 


Asn 


Glu 


Glu 


Gin 
415 


Gin 


Glu 


Lys 


Leu 


Leu 
420 


Arg 


Glu 


Trp 


Leu 


Asp 
425 


Cys 


Cys 


Val 


Thr 


Glu 
430 


Gly 


Gly 


Val 


Leu 


val 
435 


Ala 


Met 


Gin 


Lys 


Ser 
440 


Ser 


Arg 


Arg 


Arg 


Asn 
445 


His 


Pro 


Leu 


Val 


Thr 
450 


Gin 


Met 


Val 


Glu 


Lys 
455 


Trp 


Leu 


Asp 


Arg 


Tyr 
460 


Arg 


Gin 


He 


Arg 


Pro 
465 
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Cys Thr Ser Leu Ser Asp Gly Glu Glu Asp Glu Asp Asp Glu Asp 
470 475 480 

Glu 



<210> 23 
<211> 179 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<223> Incyte Clone No.: 3340296 

<400> 23 



Met 


Ser 


Thr 


Gly 


Ser 


Leu 


Ser 


Asp 


Val 


Glu 


Asp 


Leu 


Gin 


Glu 


Val 


1 








5 










10 










15 


Glu 


Met 


Leu 


Glu 


Cys 
20 


Asp 


Gly 


Leu 


Lys 


Met 
25 


Asp 


Ser 


Asn 


Lys 


Glu 

30 


Phe 


Val 


Thr 


Ser 


Asn 
35 


Glu 


Ser 


Thr 


Glu 


Glu 
40 


Ser 


Ser 


Asn 


Cys 


Glu 
45 


Asn 


Gly 


Ser 


Pro 


Gin 


Lys 


Gly 


Arg 


Gly 


Gly 


Leu 


Gly 


Lys 


Arg 


Arg 








50 










55 










60 


Lys 


Ala 


Pro 


Thr 


Lys 


Lys 


Ser 


Pro 


Leu 


Ser 


Gly 


Val 


Ser 


Gin 


Glu 








65 










70 










75 


Gly 


Lys 


Gin 


Val 


Gin 


Arg 


Asn 


Ala 


Ala 


Asn 


Ala 


Arg 


Glu 


Arg 


Ala 






80 










85 










90 


Arg 


Met 


Arg 


Val 


Leu 


Ser 


Lys 


Ala 


Phe 


Ser 


Arg 


Leu 


Lys 


Thr 


Thr 






95 










100 










105 


Leu 


Pro 


Trp 


Val 


Pro 


Pro 


Asp 


Thr 


Lys 


Leu 


Ser 


Lys 


Leu 


Asp 


Thr 








110 










115 










120 


Leu 


Arg 


Leu 


Ala 


Ser 


Ser 


Tyr 


lie 


Ala 


His 


Leu 


Arg 


Gin 


lie 


Leu 








125 










130 










135 


Ala 


Asn 


Asp 


Lys 


Tyr 


Glu 


Asn 


Gly 


Tyr 


lie 


His 


Pro 


Val 


Asn 


Leu 








140 










145 










150 


Thr 


Trp 


Pro 


Phe 


Met 


Val 


Ala 


Gly 


Lys 


Pro 


Glu 


Ser 


Asp 


Leu 


Lys 








155 










160 










165 


Glu 


Val 


Val 


Thr 


Ala 
170 


Ser 


Arg 


Leu 


Cys 


Gly 
175 


Thr 


Thr 


Ala 


Ser 





c210> 24 

<211> 254 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte Clone No.: 3536740 

<400> 24 

Met He Asp Glu He Leu Ser Lys Glu Thr Cys Asp Tyr Phe Glu 
15 10 15 
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Lys Leu Ser Leu Tyr Ser Val Cys Pro Ser Leu Val Val Arg Pro 

20 25 30 

Lys Pro Leu His Ser Cys Thr Gly Ser Pro Ser Leu Arg Ala Tyr 

35 40 45 

Pro Leu Leu Ser Val lie Thr Arg Gin Pro Thr Val lie Ser His 

50 55 60 

Leu Val Pro Ala Thr Pro Gly lie Ala Gin Ala Leu Ser Cys His 

65 70 75 

Gin Val Thr Glu Ala Val Ser Ala Glu Ala Pro Gly Gly Glu Ala 

80 85 90 

Leu Ala Ser Ser Glu Ser Glu Thr Glu Gin Pro Thr Pro Arg Gin 

95 100 105 

Lys Lys Pro Arg Arg Ser Arg Thr lie Phe Thr Glu Leu Gin Leu 

110 115 120 

Met Gly Leu Glu Lys Lys Phe Gin Lys Gin Lys Tyr Leu Ser Thr 

125 130 135 

Pro Asp Arg Leu Asp Leu Ala Gin Ser Leu Gly Leu Thr Gin Leu 

140 145 150 

Gin Val Lys Thr Trp Tyr Gin Asn Arg Arg Met Lys Trp Lys Lys 

155 160 165 

Met Val Leu Lys Gly Gly Gin Glu Ala Pro Thr Lys Pro Lys Gly 

170 175 180 

Arg Pro Lys Lys Asn Ser lie Pro Thr Ser Glu Glu lie Glu Ala 

185 190 195 

Glu Glu Lys Met Asn Ser Gin Ala Gin Gly Gin Glu Gin Leu Glu 

200 205 210 

Pro Ser Gin Gly Gin Glu Glu Leu Cys Glu Ala Gin Glu Pro Lys 

215 220 225 

Ala Arg Asp Val Pro Leu Glu Met Ala Glu Pro Pro Asp Pro Pro 

230 235 240 

Gin Glu Leu Pro lie Pro Ser Ser Glu Pro Pro Pro Leu Ser 

245 250 



<210> 25 
<211> 498 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Clone No.: 082155 

<400> 25 

Met Asp Phe Ser Val Lys Val Asp lie Glu Lys Glu Val Thr Cys 

15 10 15 

Pro lie Cys Leu Glu Leu Leu Thr Glu Pro Leu Ser Leu Asp Cys 

20 25 30 

Gly His Ser Phe Cys Gin Ala Cys lie Thr Ala Lys lie Lys Glu 

35 40 45 

Ser Val lie He Ser Arg Gly Glu Ser Ser Cys Pro Val Cys Gin 

50 55 60 

Thr Arg Phe Gin Pro Gly Asn Leu Arg Pro Asn Arg His Leu Ala 

65 70 75 

Asn lie Val Glu Arg Val Lys Glu Val Lys Met Ser Pro Gin Glu 
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80 85 90 

Gly Gin Lys Arg Asp Val Cys Glu His His Gly Lys Lys Leu Gin 

95 100 105 

lie Phe Cys Lys Glu Asp Gly Lys Val lie Cys Trp Val Cys Glu 

110 115 120 

Leu Ser Gin Glu His Gin Gly His Gin Thr Phe Arg lie Asn Glu 

125 130 135 

Val Val Lys Glu Cys Gin Glu Lys Leu Gin Val Ala Leu Gin Arg 

140 145 150 

Leu lie Lys Glu Asp Gin Glu Ala Glu Lys Leu Glu Asp Asp lie 

155 160 165 

Arg Gin Glu Arg Thr Ala Trp Lys Asn Tyr lie Gin lie Glu Arg 

170 175 180 

Gin Lys lie Leu Lys Gly Phe Asn Glu Met Arg Val lie Leu Asp 

185 190 195 

Asn Glu Glu Gin Arg Glu Leu Gin Lys Leu Glu Glu Gly Glu Val 

200 205 210 

Asn Val Leu Asp Asn Leu Ala Ala Ala Thr Asp Gin Leu Val Gin 

215 220 225 

Gin Arg Gin Asp Ala Ser Thr Leu lie Ser Asp Leu Gin Arg Arg 

230 235 240 

Leu Thr Gly Ser Ser Val Glu Met Leu Gin Asp Val lie Asp Val 

245 250 255 

Met Lys Arg Ser Glu Ser Trp Thr Leu Lys Lys Pro Lys Ser Val 

260 265 270 

Ser Lys Lys Leu Lys Ser Val Phe Arg Val Pro Asp Leu Ser Gly 

275 280 285 

Met Leu Gin Val Leu Lys Glu Leu Thr Asp Val Gin Tyr Tyr Trp 

290 295 300 

Val Asp Val Met Leu Asn Pro Gly Ser Ala Thr Ser Asn Val Ala 

305 310 315 

lie Ser Val Asp Gin Arg Gin Val Lys Thr Val Arg Thr Cys Thr 

320 325 330 

Phe Lys Asn Ser Asn Pro Cys Asp Phe Ser Ala Phe Gly Val Phe 

335 340 345 

Gly Cys Gin Tyr Phe Ser Ser Gly Lys Tyr Tyr Trp Glu Val Asp 

350 355 360 

Val Ser Gly Lys lie Ala Trp lie Leu Gly Val His Ser Lys lie 

365 370 375 

Ser Ser Leu Asn Lys Arg Lys Ser Ser Gly Phe Ala Phe Asp Pro 

380 385 390 

Ser Val Asn Tyr Ser Lys Val Tyr Ser Arg Tyr Arg Pro Gin Tyr 

395 400 405 

Gly Tyr Trp Val lie Gly Leu Gin Asn Thr Cys Glu Tyr Asn Ala 

410 415 420 

Phe Glu Asp Ser Ser Ser Ser Asp Pro Lys Val Leu Thr Leu Phe 

425 430 435 

Met Ala Val Pro Pro Cys Arg lie Gly Val Phe Leu Asp Tyr Glu 

440 445 450 

Ala Gly lie Val Ser Phe Phe Asn Val Thr Asn His Gly Ala Leu 

455 460 465 

lie Tyr Lys Phe Ser Gly Cys Arg Phe Ser Arg Pro Ala Tyr Pro 

470 475 480 

Tyr Phe Asn Pro Trp Asn Cys Leu Val Pro Met Thr Val Cys Pro 

485 490 495 

Pro Ser Ser 
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<210> 26 
<211> 1299 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 095477 



<400> 26 
Met Ala Ala 
1 

Arg Ala Leu 
Pro Ala Leu 
Glu Glu Met 
Asp Leu Leu 
Leu Pro Pro 
Asn Phe Ser 
Gly Arg Gly 
Ser Ser Ser 
Tyr Gin Arg 
Gly Gly Arg 
Asp Arg Arg 
Asn Phe Glu 
Phe lie Arg 
Gin Asn Gly 
Arg Arg Asp 
Arg Ser Ala 
Glu Phe Asp 
Val Arg Ser 
Pro Glu Trp 
Asp Ser Ser 
Pro lie Pro 
Gly Glu Glu 



Glu Thr Gin 

5 

Ser Ser Gly 
20 

Pro Lys Tyr 
35 

Leu Ala Leu 
50 

Asp Lys Glu 
65 

Leu Ala Leu 
80 

Met Ser Val 
95 

Gly Gly Gly 
110 

Arg Gly Arg 

125 
Ser Phe Asp 

140 
Glu Met His 

155 
Phe Glu Lys 

170 
Glu Gly Gly 

185 

Ser Glu Ser 

200 
Glu Asp Glu 

215 
Gly Glu Arg 

230 
Gly Trp Arg 

245 
Phe Arg Asp 

260 
Gly Ser Gly 

275 
Cys Leu Glu 

290 
Gly Ala Phe 

305 
Glu Glu Gin 

320 
Cys Ser Asp 



Thr Leu Asn 
Gly Ser lie 
Lys Leu Ala 
Phe Leu Lys 
Phe Leu Pro 
Val Pro Phe 
Asn Ser Ala 
Thr Val Val 
Gly Arg Gly 
Glu Val Glu 
Arg Ser Gin 
Pro Gly Arg 
Pro Thr Ser 
Glu Asn Trp 
Asp Gly Gly 
Trp Arg Pro 
Glu His Met 
Arg Asp Asp 
Ser lie Asp 
Asp Ala Glu 
Leu Ser Leu 
Glu Met Asp 
Ser Glu Gly 



Phe Gly Pro 
10 

Thr Ser Pro 
25 

Asp Tyr Arg 
40 

Asp Asn Lys 
55 

lie Leu Gin 
70 

Thr Glu Glu 
85 

Ala Val Leu 
100 

Gly Ala Pro 
115 

Arg Gly Glu 
130 

Gly Val Phe 
145 

Ser Trp Glu 
160 

Lys Asp Val 
175 

Val Gly Arg 
190 

Arg lie Phe 
205 

Trp Arg Leu 
220 

His Ser Pro 
235 

Glu Arg Arg 
250 

Glu Arg Gly 
265 

Asp Asp Arg 
280 

Glu Glu Met 
295 

Lys Lys Val 
310 

Phe Arg Pro 
325 

Ser His Asn 



Glu Trp Leu 
15 

Pro Leu Ser 
30 

Tyr Gly Arg 
45 

lie Pro Ser 
60 

Glu Glu Pro 
75 

Glu Gin Arg 
90 

Arg Leu Thr 
105 

Arg Gly Arg 

120 

Cys Gly Phe 
135 

Gly Arg Gly 
150 

Glu Arg Gly 
165 

Gly Arg Pro 
180 

Lys His Glu 
195 

Arg Glu Glu 
210 

Ala Gly Ser 
225 

Asp Gly Pro 
240 

Arg Arg Phe 
255 

Tyr Arg Arg 
270 

Asp Ser Leu 
285 

Gly Thr Phe 
300 

Gin Lys Glu 
315 

Val Asp Glu 
330 

Glu Glu Ala 
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Lys Glu Pro 
Arg Val Gly 
Ser Ser Ala 
Ala Ser Gin 
Glu Lys Ala 
Lys Val Pro 
Pro Leu Ser 
Leu Pro Pro 
Glu Thr Pro 
Glu Pro Asp 
Glu Lys Met 
Arg Leu Ala 
lie Pro Leu 
Pro Gin Gly 
Glu Trp Phe 
Arg Ala Cys 
Met Trp Gly 
His Met Gly 
Leu Thr Ala 
Leu lie Gin 
Ala Ala Leu 
Gin Gin Phe 
lie Pro Ser 
lie Trp Glu 
Gly Gly Ser 
Pro Ala Leu 
Lys Leu Glu 
Glu Glu Glu 



335 
Asp Lys Thr 

350 
Val Glu Ala 

365 
Arg Pro Gly 

380 
Phe Glu Arg 

395 
Glu Glu Glu 

410 
Ser Arg Gly 

425 
Gin lie Pro 

440 
Pro Val Pro 

455 
Val Val Gly 

470 
Asp Glu Glu 

485 
Val Ala Tyr 

500 
Ser Lys Leu 

515 

Met His Glu 

530 
Glu lie Gin 

545 
Gin Ala Gly 

560 
Asp Glu Ser 

575 
Arg Val Pro 

590 
Glu Leu Asp 

605 
Leu Tyr Gin 

620 
Gin Gin Tyr 

635 
Ser Ser Gin 

650 
Gin Thr Leu 

665 
Val Thr Arg 

680 
Leu Gin Pro 

695 
Val Trp Asp 

710 
Glu Gin Leu 

725 
Gin Glu Arg 

740 
Glu Arg Lys 

755 



Asn Lys Lys 
Ser Glu Glu 
Thr Pro Ser 
Lys Asp Glu 
Thr Arg Met 
Asp Glu Met 
Ser Asp Thr 
Asn Pro Ser 
Ala Pro Gly 
Gly Leu Lys 
Leu Gin Asp 
Gin Glu His 
Ala Met Gin 
Gly Pro Phe 
Tyr Phe Thr 
Phe Gin Pro 
Phe Ser Pro 
Gin Glu Arg 
Met Gin His 
Ala Gin Val 
Gin Gin Gin 
Lys Met Arg 
Ser Val Ser 
Thr Ala Ser 
Leu Pro Leu 
Gin Gin Leu 
Arg Glu Ala 
Arg Gin Glu 



340 

Glu Gly Glu 
355 

Thr Pro Gin 
370 

Asp His Gin 
385 

Pro Lys Thr 
400 

Glu Asn Ser 
415 

Val Ala Asp 
430 

Ala Ser Pro 
445 

Pro Thr Leu 
460 

Met Gly Ser 
475 

His Leu Glu 
490 

Ser Ala Leu 
505 

Arg Ala Lys 
520 

Lys Trp Tyr 
535 

Asn Asn Gin 
550 

Met Ser Leu 
565 

Leu Gly Asp 
580 

Gly Pro Ala 
595 

Leu Thr Arg 
610 

Leu Gin Tyr 
625 

Leu Ala Gin 
640 

Gin Leu Ala 
655 

lie Ser Asp 
670 

Val Pro Asp 
685 

Gin Pro Thr 
700 

Asp Thr Thr 
715 

Glu Lys Ala 
730 

Glu Met Arg 
745 

Glu Leu Arg 
760 



345 

Lys Thr Asp 
360 

Thr Ser Ser 
375 

Ser Gin Glu 
390 

Glu Gin Thr 
405 

Leu Pro Ala 
420 

Val Gin Gin 
435 

Leu Leu lie 
450 

Arg Pro Val 
465 

Val Ser Thr 
480 

Gin Gin Ala 
495 

Asp Asp Glu 
510 

Gly Val Ser 
525 

Tyr Lys Asp 
540 

Glu Met Ala 
555 

Leu Val Lys 
570 

lie Met Lys 
585 

Pro Pro Pro 
600 

Gin Gin Glu 
615 

Gin Gin Phe 
630 

Gin Gin Lys 
645 

Leu Leu Leu 
660 

Gin Asn lie 
675 

Thr Gly Ser 
690 

Val Trp Glu 
705 

Thr Pro Gly 
720 

Lys Ala Ala 
735 

Ala Lys Arg 
750 

Arg Gin Gin 
765 
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Glu Glu lie Leu Arg Arg Gin Gin Glu Glu Glu Arg Lys Arg Arg 

770 775 780 

Glu Glu Glu Glu Leu Ala Arg Arg Lys Gin Glu Glu Ala Leu Arg 

785 790 795 

Arg Gin Arg Glu Gin Glu lie Ala Leu Arg Arg Gin Arg Glu Glu 

800 805 810 

Glu Glu Arg Gin Gin Gin Glu Glu Ala Leu Arg Arg Leu Glu Glu 

815 820 825 

Arg Arg Arg Glu Glu Glu Glu Arg Arg Lys Gin Glu Glu Leu Leu 

830 835 840 

Arg Lys Gin Glu Glu Glu Ala Ala Lys Trp Ala Arg Glu Glu Glu 

845 850 855 

Glu Ala - Gin Arg Arg Leu Glu Glu Asn Arg Leu Arg Met Glu Glu 

860 865 870 

Glu Ala Ala Arg Leu Arg His Glu Glu Glu Glu Arg Lys Arg Lys 

875 880 885 

Glu Leu Glu Val Gin Arg Gin Lys Glu Leu Met Arg Gin Arg Gin 

890 895 900 

Gin Gin Gin Glu Ala Leu Arg Arg Leu Gin Gin Gin Gin Gin Gin 

905 910 915 

Gin Gin Leu Ala Gin Met Lys Leu Pro Ser Ser Ser Thr Trp Gly 

920 925 930 

Gin Gin Ser Asn Thr Thr Ala Cys Gin Ser Gin Ala Thr Leu Ser 

935 940 945 

Leu Ala Glu lie Gin Lys Leu Glu Glu Glu Arg Glu Arg Gin Leu 

950 955 960 

Arg Glu Glu Gin Arg Arg Gin Gin Arg Glu Leu Met Lys Ala Leu 

965 970 975 

Gin Gin Gin Gin Gin Gin Gin Gin Gin Lys Leu Ser Gly Trp Gly 

980 985 990 

Asn Val Ser Lys Pro Ser Gly Thr Thr Lys Ser Leu Leu Glu lie 

995 1000 1005 

Gin Gin Glu Glu Ala Arg Gin Met Gin Lys Gin Gin Gin Gin Gin 

1010 1015 1020 

Gin Gin His Gin Gin Pro Asn Arg Ala Arg Asn Asn Thr His Ser 

1025 1030 1035 

Asn Leu His Thr Ser lie Gly Asn Ser Val Trp Gly Ser lie Asn 

1040 1045 1050 

Thr Gly Pro Pro Asn Gin Trp Ala Ser Asp Leu Val Ser Ser lie 

1055 1060 1065 

Trp Ser Asn Ala Asp Thr Lys Asn Ser Asn Met Gly Phe Trp Asp 

1070 1075 1080 

Asp Ala Val Lys Glu Val Gly Pro Arg Asn Ser Thr Asn Lys Asn 

1085 1090 1095 

Lys Asn Asn Ala Ser Leu Ser Lys Ser Val Gly Val Ser Asn Arg 

1100 1105 1110 

Gin Asn Lys Lys Val Glu Glu Glu Glu Lys Leu Leu Lys Leu Phe 

1115 1120 1125 

Gin Gly Val Asn Lys Ala Gin Asp Gly Phe Thr Gin Trp Cys Glu 

1130 1135 1140 

Gin Met Leu His Ala Leu Asn Thr Ala Asn Asn Leu Asp Val Pro 

1145 1150 1155 

Thr Phe Val Ser Phe Leu Lys Glu Val Glu Ser Pro Tyr Glu Val 

1160 1165 1170 

His Asp Tyr lie Arg Ala Tyr Leu Gly Asp Thr Ser Glu Ala Lys 

1175 1180 1185 

Glu Phe Ala Lys Gin Phe Leu Glu Arg Arg Ala Lys Gin Lys Ala 
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1190 1195 1200 

Asn Gin Gin Arg Gin Gin Gin Gin Leu Pro Gin Gin Gin Gin Gin 

1205 1210 1215 

Gin Pro Pro Gin Gin Pro Pro Gin Gin Pro Gin Gin Gin Asp Ser 

1220 1225 1230 

Val Trp Gly Met Asn His Ser Thr Leu His Ser Val Phe Gin Thr 

1235 1240 1245 

Asn Gin Ser Asn Asn Gin Gin Ser Asn Phe Glu Ala Val Gin Ser 

1250 1255 1260 

Gly Lys Lys Lys Lys Lys Gin Lys Met Val Arg Ala Asp Pro Ser 

1265 1270 1275 

Leu Leu Gly Phe Ser Val Asn Ala Ser Ser Glu Arg Leu Asn Met 

1280 1285 1290 

Gly Glu lie Glu Thr Leu Asp Asp Tyr 

1295 



<210> 27 

<211> 951 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte Clone No. : 1399169 

<400> 27 

Met Ala Thr Gly Thr Gly Lys His Lys Leu Leu Ser Thr Gly Pro 

15 10 15 

Thr Glu Pro Trp Ser lie Arg Glu Lys Leu Cys Leu Ala Ser Ser 

20 25 30 

Val Met Arg Ser Gly Asp Gin Asn Trp Val Ser Val Ser Arg Ala 

35 40 45 

lie Lys Pro Phe Ala Glu Pro Gly Arg Pro Pro Asp Trp Phe Ser 

50 55 60 

Gin Lys His Cys Ala Ser Gin Tyr Ser Glu Leu Leu Glu Thr Thr 

65 70 75 

Glu Thr Pro Lys Arg Lys Arg Gly Glu Lys Gly Glu Val Val Glu 

80 85 90 

Thr Val Glu Asp Val lie Val Arg Lys Leu Thr Ala Glu Arg Val 

95 100 105 

Glu Glu Leu Lys Lys Val lie Lys Glu Thr Gin Glu Arg Tyr Arg 

110 115 120 

Arg Leu Lys Arg Asp Ala Glu Leu lie Gin Ala Gly His Met Asp 

125 130 135 

Ser Arg Leu Asp Glu Leu Cys Asn Asp lie Ala Thr Lys Lys Lys 

140 145 150 

Leu Glu Glu Glu Glu Ala Glu Val Lys Arg Lys Ala Thr Asp Ala 

155 160 165 

Ala Tyr Gin Ala Arg Gin Ala Val Lys Thr Pro Pro Arg Arg Leu 

170 175 180 

Pro Thr Val Met Val Arg Ser Pro lie Asp Ser Ala Ser Pro Gly 

185 190 195 

Gly Asp Tyr Pro Leu Gly Asp Leu Thr Pro Thr Thr Met Glu Glu 

200 205 210 
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Ala Thr Ser Gly Val Thr Pro Gly Thr Leu Pro Ser Thr Pro Val 

215 220 225 
Thr Ser Phe Pro Gly lie Pro Asp Thr Leu Pro Pro Gly Ser Ala 

230 235 240 

Pro Leu Glu Ala Pro Met Thr Pro Val Thr Asp Asp Ser Pro Gin 

245 250 255 
Lys Lys Met Leu Gly Gin Lys Ala Thr Pro Pro Pro Ser Pro Leu 

260 265 270 
Leu Ser Glu Leu Leu Lys Lys Gly Ser Leu Leu Pro Thr Ser Pro 

275 280 285 

Arg Leu Val Asn Glu Ser Glu Met Ala Val Ala Ser Gly His Leu 

290 295 300 

Asn Ser Thr Gly Val Leu Leu Glu Val Gly Gly Val Leu Pro Met 

305 310 315 

He His Gly Gly Glu He Gin Gin Thr Pro Asn Thr Val Ala Ala 

320 325 330 

Ser Pro Ala Ala Ser Gly Ala Pro Thr Leu Ser Arg Leu Leu Glu 

335 340 345 

Ala Gly Pro Thr Gin Phe Thr Thr Pro Leu Ala Ser Phe Thr Thr 

350 355 360 

Val Ala Ser Glu Pro Pro Val Lys Leu Val Pro Pro Pro Val Glu 

365 370 375 

Ser Val Ser Gin Ala Thr He Val Met Met Pro Ala Leu Pro Ala 

380 385 390 

Pro Ser Ser Ala Pro Ala Val Ser Thr Thr Glu Ser Val Ala Pro 

395 400 405 

Val Ser Gin Pro Asp Asn Cys Val Pro Met Glu Ala Val Gly Asp 

410 415 420 

Pro His Thr Val Thr Val Ser Met Asp Ser Ser Glu lie Ser Met 

425 430 435 

lie lie Asn Ser He Lys Glu Glu Cys Phe Arg Ser Gly Val Ala 

440 445 450 

Glu Ala Pro Val Gly Ser Lys Ala Pro Ser lie Asp Gly Lys Glu 

455 460 465 

Glu Leu Asp Leu Ala Glu Lys Met Asp He Ala Val Ser Tyr Thr 

470 475 480 

Gly Glu Glu Leu Asp Phe Glu Thr Val Gly Asp He He Ala lie 

485 490 495 

He Glu Asp Lys Val Asp Asp His Pro Glu Val Leu Asp Val Ala 

500 505 510 

Ala Val Glu Ala Ala Leu Ser Phe Cys Glu Glu Asn Asp Asp Pro 

515 520 525 

Gin Ser Leu Pro Gly Pro Trp Glu His Pro He Gin Gin Glu Arg 

530 535 540 

Asp Lys Pro Val Pro Leu Pro Ala Pro Glu Met Thr Val Lys Gin 

545 550 555 

Glu Arg Leu Asp Phe Glu Glu Thr Glu Asn Lys Gly He His Glu 

560 565 570 

Leu Val Asp He Arg Glu Pro Ser Ala Glu He Lys Val Glu Pro 

575 580 585 

Ala Glu Pro Glu Pro Val He Ser Gly Ala Glu He Val Ala Gly 

590 595 600 

Val Val Pro Ala Thr Ser Met Glu Pro Pro Glu Leu Arg Ser Gin 

605 610 615 

Asp Leu Asp Glu Glu Leu Gly Ser Thr Ala Ala Gly Glu He Val 

620 625 630 

Glu Ala Asp Val Ala He Gly Lys Gly Asp Glu Thr Pro Leu Thr 
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635 640 645 

Asn Val Lys Thr Glu Ala Ser Pro Glu Ser Met Leu Ser Pro Ser 

650 655 660 

His Gly Ser Asn Pro He Glu Asp Pro Leu Glu Ala Glu Thr Gin 

665 670 675 

His Lys Phe Glu Met Ser Asp Ser Leu Lys Glu Glu Ser Gly Thr 

680 685 690 

He Phe Gly Ser Gin lie Lys Asp Ala Pro Gly Glu Asp Glu Glu 

695 700 705 

Glu Asp Gly Val Ser Glu Ala Ala Ser Leu Glu Glu Pro Lys Glu 

710 715 720 

Glu Asp Gin Gly Glu Gly Tyr Leu Ser Glu Met Asp Asn Glu Pro 

725 730 735 

Pro Val Ser Glu Ser Asp Asp Gly Phe Ser He His Asn Ala Thr 

740 745 750 

Leu Gin Ser His Thr Leu Ala Asp Ser lie Pro Ser Ser Pro Ala 

755 760 765 

Ser Ser Gin Phe Ser Val Cys Ser Glu Asp Gin Glu Ala lie Gin 

770 775 780 

Ala Gin Lys lie Trp Lys Lys Ala He Met Leu Val Trp Arg Ala 

785 790 795 

Ala Ala Asn His Arg Tyr Ala Asn Val Phe Leu Gin Pro Val Thr 

800 805 810 

Asp Asp He Ala Pro Gly Tyr His Ser He Val Gin Arg Pro Met 

815 820 825 

Asp Leu Ser Thr He Lys Lys Asn He Glu Asn Gly Leu He Arg 

830 835 840 

Ser Thr Ala Glu Phe Gin Arg Asp He Met Leu Met Phe Gin Asn 

845 850 855 

Ala Val Met Tyr Asn Ser Ser Asp His Asp Val Tyr His Met Ala 

860 865 870 

Val Glu Met Gin Arg Asp Val Leu Glu Gin He Gin Gin Phe Leu 

875 880 885 

Ala Thr Gin Leu He Met Gin Thr Ser Glu Ser Gly He Ser Ala 

890 895 900 

Lys Ser Leu Arg Gly Arg Asp Ser Thr Arg Lys Gin Asp Ala Ser 

905 910 915 

Glu Lys Asp Ser Val Pro Met Gly Ser Pro Ala Phe Leu Leu Ser 

920 925 930 

Leu Phe Asp Gly Gly Thr Arg Gly Arg Arg Cys Ala He Glu Ala 

935 940 945 

Asp Met Lys Met Lys Lys 

950 



<210> 28 
<211> 282 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Clone No.: 1442069 

<400> 28 
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Met 


Pro 


Lys 


Arg 


Lys 


Ala 


Ala 


Glv 


Gin 


Glv 


Asp 


Met 


Arcr 


Gin 


Glu 


1 








5 










10 










15 


Pro 


Lys 


Arrr 


Arcr 


Ser 
20 


Ala 


Arg 


Leu 


Ser 


Ala 
25 


Met 


Leu 


Val 


Pro 


Val 
30 


Thr 


Pro 


Glu 


Val 


Lys 
35 


Piro 


Lys 


Arcr 


Thr 


Ser 
40 


Ser 


Ser 


Arg 


Lys 


Met 

45 


Lys 


Thr" 


Lys 


Ser 


Asp 
50 


Met 


Met 


Glu 


Glu 


Asn 
55 


lie 


Asp 


Thr 


Ser 


Ala 
60 


Gin 


Ala 


Val 


Ala 


Glu 
65 


Thr 


Lys 


Gin 


Glu 


Ala 
70 


Val 


Val 


Glu 


Glu 


Asp 
75 


Tvr 


Asn 


Glu 


Asn 


Ala 
80 


Lys 


Asn 


Glv 


Glu 


Ala 
85 


Lvs 


He 


Thr 


Glu 


Ala 
90 


Pro 


Ala 


Ser 


Glu 


Lys 
95 


Glu 


lie 


Val 


Glu 


Val 
100 


Lys 


Glu 


Glu 


Asn 


lie 
105 


Glu 


Asp 


Ala 


Till* 


Glu 
110 


Lys 


Glv 


Glv 


Glu 


Lys 
115 


Lys 


Glu 


Ala 


Val 


Ala 
120 


Ala 


Glu 


Val 


Lys 


Asn 
125 


Glu 


Glu 


Glu 


Asp 


Gin 
130 


Lys 


Glu 


Asp 


Glu 


Glu 

135 


AsiD 


Gin 


Asn 


Glu 


Glu 
140 


Lys 


Gly 


Glu 


Ala 


Glv 
145 


Lys 


Glu 


Asp 


Lys 


Asp 
150 


Glu 


Lvs 


Gly 


Glu 


Glu 
155 


Asp 


Gly 


Lys 


Glu 


Asp 
160 


Lys 


Asn 


Glv 


Asn 


Glu 
165 




Gly 


Glu 


Asp 


Ala 
170 


Lys 


Glu 


Lys 


Glu 


Asp 
175 


Gly 


Lys 


Lys 


Glv 


Glu 
180 




Gly 




Gly 


Asn 
185 


Gly 


Glu 


Asp 


Glv 

vjx y 


Lys 
190 


Glu 


Lys 


Glv 


Glu 


Asp 
195 


Glu 


Lys 


Glu 


Glu 


Glu 
200 


Asp 


Arg 


Lys 


Glu 


Thr 
205 


Glv 


Asp 


Glv 


Lys 


Glu 
210 


Asn 


blU 


Asp 


C~\ \s 

uiy 


215 


Glu 




Gly 




220 




Glu 


Gly 


Lys 


Asp 
225 


Val 


Lys 


Val 


Lys 


Glu 
230 


Asp 


Glu 


Lys 


Glu 


Arg 
235 


Glu 


Asp 


Gly 


Lys 


Glu 
240 


Asp 


Glu 


Gly 


Gly 


Asn 
245 


Glu 


Glu 


Glu 


Ala 


Gly 
250 


Lys 


Glu 


Lys 


Glu 


Asp 
255 


Leu 


Lys 


Glu 


Glu 


Glu 
260 


Glu 


Gly 


Lys 


Glu 


Glu 
265 


Asp 


Glu 


He 


Lys 


Glu 

270 


Asp 


Asp 


Gly 


Lys 


Lys 
275 


Glu 


Glu 


Pro 


Gin 


Ser 
280 


He 


Val 









<210> 29 
<211> 186 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Clone No.: 1596668 

<400> 29 

Met Asp Ala Asp Ser Asp Val Ala 

1 5 
Val Val Cys Val Phe Arg Thr Arg 
20 

lie Ala Leu Glu Gly Ala Asn Val 



Leu Asp lie Leu lie Thr Asn 

10 15 

Cys His Leu Asn Leu Arg Lys 

25 30 

lie Tyr Lys Arg Asp Val Gly 
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35 40 45 

Lys Val Leu Met Lys Leu Arg Lys Pro Arg lie Thr Ala Thr lie 

50 55 60 

Trp Ser Ser Gly Lys lie lie Cys Thr Gly Ala Thr Ser Glu Glu 

65 70 75 

Glu Ala Lys Phe Gly Ala Arg Arg Leu Ala Arg Ser Leu Gin Lys 

80 85 90 

Leu Gly Phe Gin Val lie Phe Thr Asp Phe Lys Val Val Asn Val 

95 100 105 

Leu Ala Val Cys Asn Met Pro Phe Glu lie Arg Leu Pro Glu Phe 

110 115 120 

Thr Lys Asn Asn Arg Pro His Ala Ser Tyr Glu Pro Glu Leu His 

125 130 135 

Pro Ala Val Cys Tyr Arg lie Lys Ser Leu Arg Ala Thr Leu Gin 

140 145 150 

lie Phe Ser Thr Gly Ser lie Thr Val Thr Gly Pro Asn Val Lys 

155 160 165 

Ala Val Ala Thr Ala Val Glu Gin lie Tyr Pro Phe Val Phe Glu 

170 175 180 
Ser Arg Lys Glu lie Leu 
185 



<210> 30 

<211> 917 

<212> PRT 

<213> Homo sapiens 

<220> 

<22l> misc_feature 

<223> Incyte Clone No. : 1977214 

<400> 30 

Met Ala Glu Thr Leu Ser Gly Leu Gly Asp Ser Gly Ala Ala Gly 

15 10 15 

Ala Ala Ala Leu Ser Ser Ala Ser Ser Glu Thr Gly Thr Arg Arg 

20 25 30 

Leu Ser Asp Leu Arg Val lie Asp Leu Arg Ala Glu Leu Arg Lys 

35 40 45 

Arg Asn Val Asp Ser Ser Gly Asn Lys Ser Val Leu Met Glu Arg 

50 55 60 

Leu Lys Lys Ala lie Glu Asp Glu Gly Gly Asn Pro Asp Glu lie 

65 70 75 

Glu lie Thr Ser Glu Gly Asn Lys Lys Thr Ser Lys Arg Ser Ser 

80 85 90 

Lys Gly Arg Lys Pro Glu Glu Glu Gly Val Glu Asp Asn Gly Leu 

95 100 105 

Glu Glu Asn Ser Gly Asp Gly Gin Glu Asp Val Glu Thr Ser Leu 

110 115 120 

Glu Asn Leu Gin Asp lie Asp lie Met Asp lie Ser Val Leu Asp 

125 130 135 

Glu Ala Glu lie Asp Asn Gly Ser Val Ala Asp Cys Val Glu Asp 

140 145 150 

Asp Asp Ala Asp Asn Leu Gin Glu Ser Leu Ser Asp Ser Arg Glu 

155 160 165 
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Leu Val Glu Gly Glu Met Lys Glu Leu Pro Glu Gin Leu Gin Glu 

170 175 180 

His Ala lie Glu Asp Lys Glu Thr lie Asn Asn Leu Asp Thr Ser 

185 190 195 

Ser Ser Asp Phe Thr lie Leu Gin Glu lie Glu Glu Pro Ser Leu 

200 205 210 

Glu Pro Glu Asn Glu Lys lie Leu Asp lie Leu Gly Glu Thr Cys 

215 220 225 

Lys Ser Glu Pro Val Lys Glu Glu Ser Ser Glu Leu Glu Gin Pro 

230 235 240 

Phe Ala Gin Asp Thr Ser Ser Val Gly Pro Asp Arg Lys Leu Ala 

245 250 255 

Glu Glu Glu Asp Leu Phe Asp Ser Ala His Pro Glu Glu Gly Asp 

260 265 270 

Leu Asp Leu Ala Ser Glu Ser Thr Ala His Ala Gin Ser Ser Lys 

275 280 285 

Ala Asp Ser Leu Leu Ala Val Val Lys Arg Glu Pro Ala Glu Gin 

290 295 300 

Pro Gly Asp Gly Glu Arg Thr Asp Cys Glu Pro Val Gly Leu Glu 

305 310 315 

Pro Ala Val Glu Gin Ser Ser Ala Ala Ser Glu Leu Ala Glu Ala 

320 325 330 

Ser Ser Glu Glu Leu Ala Glu Ala Pro Thr Glu Ala Pro Ser Pro 

335 340 345 

Glu Ala Arg Asp Ser Lys Glu Asp Gly Arg Lys Phe Asp Phe Asp 

350 355 360 

Ala Cys Asn Glu Val Pro Pro Ala Pro Lys Glu Ser Ser Thr Ser 

365 370 375 

Glu Gly Ala Asp Gin Lys Met Ser Ser Pro Glu Asp Asp Ser Asp 

380 385 390 

Thr Lys Arg Leu Ser Lys Glu Glu Lys Gly Arg Ser Ser Cys Gly 

395 400 405 

Arg Asn Phe Trp Val Ser Gly Leu Ser Ser Thr Thr Arg Ala Thr 

410 415 420 

Asp Leu Lys Asn Leu Phe Ser Lys Tyr Gly Lys Val Val Gly Ala 

425 430 435 

Lys Val Val Thr Asn Ala Arg Ser Pro Gly Ala Arg Cys Tyr Gly 

440 445 450 

Phe Val Thr Met Ser Thr Ala Glu Glu Ala Thr Lys Cys lie Asn 

455 460 465 

His Leu His Lys Thr Glu Leu His Gly Lys Met lie Ser Val Glu 

470 475 480 

Lys Ala Lys Asn Glu Pro Val Gly Lys Lys Thr Ser Asp Lys Arg 

485 490 495 

Asp Ser Asp Gly Lys Lys Glu Lys Ser Ser Asn Ser Asp Arg Ser 

500 505 510 

Thr Asn Leu Lys Arg Asp Asp Lys Cys Asp Arg Lys Asp Asp Ala 

515 520 525 

Lys Lys Gly Asp Asp Gly Ser Gly Glu Lys Ser Lys Asp Gin Asp 

530 535 540 

Asp Gin Lys Pro Gly Pro Ser Glu Arg Ser Arg Ala Thr Lys Ser 

545 550 555 

Gly Ser Arg Gly Thr Glu Arg Thr Val Val Met Asp Lys Ser Lys 

560 565 570 

Gly Val Pro Val lie Ser Val Lys Thr Ser Gly Ser Lys Glu Arg 

575 580 585 

Ala Ser Lys Ser Gin Asp Arg Lys Ser Ala Ser Arg Glu Lys Arg 
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590 595 600 

Ser Val Val Ser Phe Asp Lys Val Lys Glu Pro Arg Lys Ser Arg 

605 610 615 

Asp Ser Glu Ser His Ser Arg Val Arg Glu Arg Ser Glu Arg Glu 

620 625 630 

Gin Arg Met Gin Ala Gin Trp Glu Arg Glu Glu Arg Glu Arg Leu 

635 640 645 

Glu lie Ala Arg Glu Arg Leu Ala Phe Gin Arg Gin Arg Leu Glu 

650 655 660 

Arg Glu Arg Met Glu Arg Glu Arg Leu Glu Arg Glu Arg Met His 

665 670 f 675 

Val Glu His Asp Gly Arg Arg Glu Gin Glu Arg lie His Arg Glu 

680 685 690 

Arg Glu Glu Leu Arg Arg Gin Gin Glu Leu Arg Tyr Glu Gin Glu 

695 700 705 

Arg Arg Pro Ala Val Arg Arg Pro Tyr Asp Leu Asp Arg Arg Asp 

710 715 720 

Asp Ala Tyr Trp Pro Glu Ala Lys Arg Ala Ala Leu Asp Glu Arg 

725 730 735 

Tyr His Ser Asp Phe Asn Arg Gin Asp Arg Phe His Asp Phe Asp 

740 745 750 

His Arg Asp Arg Gly Arg Tyr Pro Asp His Ser Val Asp Arg Arg 

755 760 765 

Glu Gly Ser Arg Ser Met Met Gly Glu Arg Glu Gly Gin His Tyr 

770 775 780 

Pro Glu Arg His Gly Gly Pro Glu Arg His Gly Gly Ala Ser Arg 

785 790 795 

Asp Gly Trp Gly Gly Tyr Gly Ser Asp Lys Arg Met Ser Glu Gly 

800 805 810 

Arg Gly Leu Pro Pro Pro Pro Arg Gly Arg Arg Asp Trp Gly Asp 

815 820 825 

His Gly Arg Arg Glu Asp Asp Arg Ser Trp Gin Gly Thr Ala Asp 

830 835 840 

Gly Gly Met Met Asp Arg Asp His Lys Arg Trp Gin Gly Gly Glu 

845 850 855 

Arg Ser Met Ser Gly His Ser Gly Pro Gly His Met Met Asn Arg 

860 865 870 

Gly Gly Met Ser Gly Arg Gly Ser Phe Ala Pro Gly Gly Ala Ser 

875 880 885 

Arg Gly His Pro lie Pro His Gly Gly Met Gin Gly Gly Phe Gly 

890 895 900 

Gly Gin Ser Arg Gly Ser Arg Pro Ser Asp Ala Arg Phe Thr Arg 

905 910 915 

Arg Tyr 



<210> 31 

<211> 392 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte Clone No.: 2181282 
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<400> 31 

Met Pro Lys Pro lie Met Val lie Pro Thr Leu Ala Ser Leu Ala 

15 10 15 

Ser Pro Thr Thr Leu Gin Ser Gin Met Leu Gly Gly Leu Gly Gin 

20 25 30 

Asp Val Leu Leu Asn Asn Ser Leu Thr Pro Lys Tyr Leu Gly Cys 

35 40 45 

Lys Gin Asp Asn Ser Ser Ser Pro Lys Pro Ser Ser Val Phe Arg 

50 55 60 

Asn Gly Phe Ser Gly lie Lys Lys Pro Trp His Arg Cys His Val 

65 70 75 

Cys Asn His His Phe Gin Phe Lys Gin His Leu Arg Asp His Met 

80 85 90 

Asn Thr His Thr Asn Arg Arg Pro Tyr Ser Cys Arg lie Cys Arg 

95 100 105 

Lys Ser Tyr Val Arg Pro Gly Ser Leu Ser Thr His Met Lys Leu 

110 115 120 

His His Gly Glu Asn Arg Leu Lys Lys Leu Met Cys Cys Glu Phe 

125 130 135 

Cys Ala Lys Val Phe Gly His lie Arg Val Tyr Phe Gly His Leu 

140 145 150 

Lys Glu Val His Arg Val Val lie Ser Thr Glu Pro Ala Pro Ser 

155 160 165 

Glu Leu Gin Pro Gly Asp lie Pro Lys Asn Arg Asp Met Ser Val 

170 175 180 

Arg Gly Met Glu Gly Ser Leu Glu Arg Glu Asn Lys Ser Asn Leu 

185 190 195 

Glu Glu Asp Phe Leu Leu Asn Gin Ala Asp Glu Val Lys Leu Gin 

200 205 210 

lie Lys Cys Gly Arg Cys Gin lie Thr Ala Gin Ser Phe Ala Glu 

215 220 225 

lie Lys Phe His Leu Leu Asp Val His Gly Glu Glu lie Glu Gly 

230 235 240 

Arg Leu Gin Glu Gly Thr Phe Pro Gly Ser Lys Gly Thr Gin Glu 

245 250 255 

Glu Leu Val Gin His Ala Ser Pro Asp Trp Lys Arg His Pro Glu 

260 265 270 

Arg Gly Lys Pro Glu Lys Val His Ser Ser Ser Glu Glu Ser His 

275 280 285 

Ala Cys Pro Arg Leu Lys Arg Gin Leu His Leu His Gin Asn Gly 

290 295 300 
Val Glu Met Leu Met Glu Asn Glu Gly Pro Gin Ser Gly Thr Asn 

305 310 315 

Lys Pro Arg Glu Thr Cys Gin Gly Pro Glu Cys Pro Gly Leu His 

320 325 330 
Thr Phe Leu Leu Trp Ser His Ser Gly Phe Asn Cys Leu Leu Cys 

335 340 345 

Ala Glu Met Leu Gly Arg Lys Glu Asp Leu Leu His His Trp Lys 

350 355 360 

His Gin His Asn Cys Glu Asp Pro Ser Lys Leu Trp Ala lie Leu 

365 370 375 

Asn Thr Val Ser Asn Gin Gly Val lie Glu Leu Ser Ser Glu Ala 

380 385 390 

Glu Lys 
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<210> 32 

<211> 1566 

<212> DNA 

<213> Homo sapiens 



<220> 

<22l> mi. sc_ feature 
<223> Incvte Clone No. 



591290 



<400> 32 

tggttcagaa 

ggcggcggcg 

agaggcaatg 

caagacttaa 

tttacatgca 

gtttgcagaa 

gactataaaa 

gaaggtaaaa 

ttaataaaga 

tttgacaacg 

gttcataatg 

gaaccagtca 

gaaatctcca 

gtgaagaaga 

aaagaactaa 

aaaaagggac 

tctgcaggga 

gcacgcgtgg 

aagaaaaaga 

cctgcaaaag 

gacaatgaaa 

acagatgagc 

aagaacccta 

tatttaaaaa 

taatgaattc 

atatttttgg 

aaaaaa 



gcgaattctg 
agtccacgtg 
tttgctgtct 
aacctgtctt 
atgcatgtgg 
actgtgaatg 
accacgtgaa 
cccacaaagg 
gacccaatgt 
ttcccaggaa 
aatccattct 
ataaggaaca 
ccaaggttcc 
ataaaagaga 
agttagaaaa 
aggaggctga 
agaggagcaa 
gcgcagggaa 
tgaagctccc 
gtaaattcaa 
taaccatcaa 
atcacagatc 
cctttaagtt 
ttgaatccat 
taacaactca 
tataactttt 



caccgcgaac 
ctccccgcgg 
tccattggag 
ggatagagaa 
tgaatcagtg 
cctttcttgc 
atgcataagt 
cgacatcaaa 
cagccccaaa 
aaaggcaaaa 
ggaccaggtg 
ggatcaacgg 
agcctccaaa 
aaaaaaggaa 
ccaccaggaa 
ccttgaggct 
gaagaagaag 
gaggaagcgg 
agagcatcct 
ctggaaggga 
aaagctaagg 
cgaagaggaa 
attaaaggac 
tctgctgact 
aattttgctt 
atgagaaaaa 



ctgagaggca 
ccggttgaaa 
tgactgaatt 
tatttagcca 
aagaaaatac 
attgactgcg 
gaagatcaga 
cagcaggcgt 
gtgagagaac 
tttcagaatt 
tggaatatct 
ccactccacc 
gtgaaagacg 
gaacggcaga 
aactcaagga 

ggtggggagg 

cagcgcaagg 
aggcactcgg 
gagggcggag 
actattaaag 
aaaaaggttt 
ctcctggtca 
aaagtcaagc 
tcttcctttc 
tttgaagctg 
taaaatatat 



ggtagctgcg 
ccgttggcgg 
tctacatgac 
tttacctaaa 
aagtggaaaa 
gtaaagattt 
agtatggtgg 
ggattcagaa 
ttttagagca 
ggatgaagaa 
tttctgaagc 
cagtggcaaa 
ccgtggaaca 
agaaaaggaa 
atcagaagcc 
aagtccctga 
acagcgccag 
aagttgaaac 
aaccagaaga 
caattctgaa 
tagctcagta 
tctttaacaa 
ttgtgaaatg 
actgctgttt 
tatttttaag 
tctggtccaa 



gcgctgggct 
gcgctggctg 
ggctttttga 
aatggtattt 
gcatgtgtct 
ctggggcgat 
caaaggctat 
aattagtgaa 
aattagtgct 
cagtttaaaa 
ttccaacagc 
tccacatgca 
gcaaggggag 
aagagaaaag 
taagaagcgc 
ggccaatggc 
tgaggaagag 
agattctaag 
cgatgaggct 
acaggcccca 
ctacacagtg 
gaaaatcagc 
aacatttgtg 
ataaaatgtg 
ttaagaaaat 
acttctaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1566 



<210> 33 
<211> 2338 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 815856 

<400> 33 

aggggaggtt ctccttttgc tattgtcatc acccagcaac accagattca ccgctcctgc 6 0 
acagtcaaca tcatgtttgg aaccccgcaa gagcatcgca acatgcccca agcagatgcc 120 
atggtgctgg tggccagaaa tetatgagcgt taeaagaatg agtgccggga gaaggaacgt 18 0 
gaggagattg ccagacaggc agccaagatg gccgatgaag ccatcctgca ggaaagagag 24 0 
agaggaggcc ctgaggaggg agtgcgtggg ggccaccctc cagccatcca gagcctcatc 3 00 
aacctgctgg cagacaacag gtacctcact gctgaagaga ctgacaagat catcaactac 360 
ctgcgagagc ggaaggagcg gctgatgagg agcagcaccg actctctgcc tggtgagcta 420 
cgtggcaggg ccgaggcccg atttcccgcc aaccactcgg ggcgacctcg ggtgcctcgc 4 80 
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tgaagacaca 
ccactccatc 
tcaatagtgg 
acaccccaaa 
cttctggcaa 
ccagacctgg 
ctagcaacat 
gtgtacagaa 
gccagaccac 
actgaagcta 
cttgttctaa 
tcatcaattt 
aagtttatac 
atctccgggg 
ttctactgga 
gtagccacca 
ccagagctca 
tggcaggcac 
taaatcagcc 
tctctctgag 
tggaagatct 
ctgggcccca 
cttccatgaa 
ttgggctttt 
tgagcagcag 
gagatgggtg 
ggctgtggtt 
gagccctgtg 
agctgagtcc 
gcagcatttt 
actcttgtgt 



gccaagctcc 
tgcacccccc 
cacagtgacg 
ccagaatttt 
tcagcctcca 
ggctccttcc 
gactagccag 
ggctctggat 
agcacagatg 
aatctttcaa 
atagagctgt 
ctaccacctg 
tgcatttggg 
gcagtacagg 
tttggagcca 
cctttgttca 
agctcctgcc 
agtgtcatca 
actctggatt 
ccaaggttgt 
ccaatggact 
ccactcttcc 
gttggttctg 
ggctcaaaac 
gaggtaggga 
gctgttcgcg 
ctaaatggag 
gatcacagtc 
ctgccagtgg 
atgttctgac 
tgttaataaa 



caaccgctcc 
acctcccagc 
gccaatagca 
tccacagcag 
agcattttgg 
caagggcttt 
aggcctgtgt 
accctgatcc 
gggcagccac 
ctctccccag 
ttggaggatg 
ctctctcttc 
gctgtatctt 
tgtctgggct 
ggcccagcta 
ttgtgaacct 
tgcaactcct 
taccctcacc 
gttgaggaat 
tttttgtttt 
gaacagctcc 
ctgttaacca 
agactctggc 
cccaggctct 
ccactttgat 
acttttgtac 
ccccaggaag 
ttgaggtcct 
ttaagcaaac 
ctgtttgtgt 
atgaaatgat 



agagcggcca 
aagagcttca 
gctctgcatc 
caaacagcca 
gacagggagg 
ttggccagcc 
cttccacagg 
agagtggccc 
aggcccccat 
tcccctcatc 
ttctctgcgc 
tgcccaaggc 
tttttgtttt 
tgtatttgat 
gccaagtttg 
accaaggctt 
gcctagagtt 
ccatatgttt 
ggccatggca 
tttctctttt 
agtcagcagc 
gttctgtcag 
cgtgaacacc 
tgtttttgtc 
gtcagacttc 
cagagtgaaa 
ctgccctctc 
ctagcagggg 
aatggtttca 
tatatagtgg 
tactttttaa 



agtgctcccc 
ggccaaaat c 
cccctcggtt 
gcc.tcaacaa 
atctgctcag 
ttccagtcgc 
tatcaacttt 
tgctctctcc 
gggatcttac 
ccctggcctc 
tcccaggccg 
tgtgttgctt 
ttgttttgta 
ggggtttctc 
gaatggcatt 
tccagcttca 
gaagaaaagc 
ggggtctgct 
gccacagaaa 
cttttttgtt 
agttaccaca 
catccccctc 
cgtttcttca 
tagactctta 
tggtagctgg 
ttgttagaag 
cccagggttt 
tgagggagag 
aaattcaagg 
tttttttttt 
ttaaaatgaa 



tctgctacac 
ctcagcctct 
gctgccggaa 
agatcacagg 
aacatgggcc 
ctggcacctg 
gacaatccaa 
cacctggtta 
cagaggcatt 
ctcccactta 
gcatcgagtg 
attccttaca 
gaaaataaat 
cggtccctgt 
tgtcatgtca 
tacacattga 
aaactggcct 
tgaggattca 
aaagaatttt 
ttcatttcat 
aactgtgaat 
tccagcagca 
gtgatttgtt 
ttctgtttcc 
acatgttctc 
gagggtttct 
gtgctcagta 
cagcgacttc 
tccccaaatg 
cctctttgga 
aaaaaaaa 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2338 



<210> 34 
<211> 870 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte Clone No. 



996352 



<400> 34 

aaaaccgaca 

attatgtgaa 

agccgacttc 

gcgccttcgc 

cctttgcgtc 

acgaccaaaa 

acgagggcgc 

cttcaggacc 

tgg^ga.agga 

acgcgctctg 

ggaagaaaga 

cctctgccgc 

aggaagacga 

aagcgtcttt 



ctgaaagctg 
cgcgcttttt 
ttgctccttt 
ctcgaccgtt 
cggctacggc 
catccccggg 
gctgcatggc 
tgacaggaag 
gttccacttc 
cctcaccgag 
gcataaggac 
cgccactgct 
ggaggaatga 
aagagactca 



ccggacaaca 
agcaaatata 
gctcccaact 
ccgggcttat 
ctgggcgccg 
ctctgcagtg 
gcggctgagg 
cggggccgcc 
aaccgctacc 
cgccagatta 
gaaggtccga 
gccgcggaca 

ggggccgatc 

ctggttttac 



aatcacaggt 
cggcgggggc 
cacagagaag 
acaatgtcaa 
acgcctacgg 
acctcgccaa 
ccaatttccg 
agacctacac 
tgatccggcg 
agatctggtt 
ctgccgccgc 
aggccgacga 
cggggccctc 
ttacaaaaat 



caaaattatg 
ttctctgttc 
cggctacggg 
cagccccctt 
caacctgccc 
aggcgcctgc 
catctacccc 
gcgctaccag 
ccgccgcatt 
ccagaaccgc 
agctcccgag 
ggaggacgat 
tctgcaccgg 
gggaaaaata 



agttcttcgt 
caaaatgccg 
gcgggcgccg 
tatcagagcc 
tgcgcctcct 
gacaagacgg 
tggatgcggt 
acgctggagc 
gaaatcgccc 
cgcatgaagt 
ggcgccgtgc 
gatgaagaag 
acagtcggaa 
aaagaaaatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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870 



<210> 35 

<211> 1365 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> unsure 

<222> 6, 61, 882, 883, 884, 885, 886, 887, 888, 889, 890, 891, 892, 893, 
8 94, 8 95, <221> unsure 

<222> 896, 897, 898, 899, 900, 901, 902, 903, 904, 905, 906, 907, 908, 912, 
914, 916, <221> unsure 

<222> 920, 922, 929, 932, 964, 967, 971, 975, 986, 1000, 1028, 1033, 1039, 
1052, <221> unsure 

<222> 1053, 1063, 1065, 1071, 1080, 1090, 1100, 1102, 1110, 1113, 1123, 
1128, <221> unsure 

<222> 1134, 1139, 1142, 1143, 1144, 1153, 1158, 1164, 1169, 1177, 1180, 
1181, <221> unsure 

<222> 1182, 1184, 1189, 1191, 1192, 1211, 1224, 1239, 1246, 1247, 1248, 
1263, <221> unsure 

<222> 1266, 1274, 1284, 1291, 1314, 1321, 1322, 1339, 1341, 1343, 1347, 
1351, <221> unsure 

<222> 1353, 1359, 1362, 1363, 1364 

<22 3> a or g or c or t, unknown, or other 



<220> 

<221> misc_f eature 
<223> Incyte Clone No. 



1273778 



<400> 35 

cttcangcct 

ngtcggccgc 

ctcggggcgc 

cgacggcggg 

atttcctggc 

gcactgttgt 

tatgtcgtca 

atctctaaga 

gtctcaggtg 

agattctgct 

ctctgtaggt 

gccctggatg 

tcctgcttct 

aatatttgag 

taattccact 

nnnnnnnngg 

acanggnatt 

tggaaaangt 

tccagaatgn 

gnnngaaaca 

agaggaattt 

aanacntttg 

nnccttcccc 



ggcggcggct 
cgttacccgc 
ctgcgggacg 
gctaaggccc 
cttcctgttg 
gagagcgaag 
tgaagccggc 
taagaacctg 
acatttagtg 
cagagggact 
ctttccgtaa 
atggagaaaa 
gtgaactttg 
ttgtcagaac 
gtagaagctt 
cngngnttgn 
ntcancaaag 
tgnaaaacna 
gacacgtttn 
aanttctncc 
nttccaaaag 
aaantcgaag 
ggtctgttnt 



tccaagctaa 
tcaccagcta 
gtgaggccct 
tgggtccgcg 
agccgtgtaa 
tgggcgcgag 
gctttcagtt 
gaaaggggac 
atgtggctat 
tatacaagga 
ctaagccata 
aactgtcaaa 
gattctctgc 
tatgtgtttt 
ttttcaagaa 
cnttcttgnt 
aagganattt 
agttttgaat 
gnaagagccn 
tgtnggggna 
ggtnccggtc 
aaangggggg 
ngngggnaag 



ggaacggttt 
cgcggcgcgt 
gctgaggact 
cgcggtttga 
acgcggggtg 
agcagacgcc 
gtgcaacctt 
tctgttggcc 
agacttctct 
tgtgatggtc 
tgtgatcatg 
agcttaccca 
tctatttgag 
ctgggtgctt 
gtaaaaaaaa 
anatttcaga 
gtgatgaagn 
tnncaaaatt 
ggnattccaa 
tttcccncen 
ataaaaggng 
ncggccttcc 
ntntttacng 



ggggcagtgt 
caggtccgcg 
ccggcaagtg 
ccacggccgg 
atgacggcgc 
agctacagtt 
gaacaaatgg 
attggaaatt 
catgaagagt 
cagaattatg 
ttattggagg 
tttcctttat 
cattgttcag 
catttcttat 
annnnnnnnn 
ttgggaatca 
ttcaccccaa 
ctnantaagg 
agngtacnct 
nnantatgnt 
gggggnnnaa 
aacccggggc 
gnnng 



cgttcccgga 
gaggcgcggg 
tgggtcgcgg 
ggccttgggc 
cgacctcttg 
ttttttgggt 
gacactgccc 
gcagaataat 
gggcatgcct 
agaacctggt 
atggaaaaga 
cacactctgt 
aagtcactga 
ccaattctcc 
nnnnnnnnnn 
agatgggaaa 
ccagtaacaa 
ntttggaatn 
ttcntgggnc 
nnccaggttg 
tggggggaaa 
caancccgta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1365 



42/66 



WO 99/64596 



PCT/US99/13281 



<210> 36 

<211> 2402 

<212> DNA 

<213 > Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte Clone No.: 1509715 

<400> 36 

agcgcgggac aaagggaagc gaagccggag ctgcgggcgc tttttctgcc cgcggtgtct 6 0 
cagattcatt cttaaggaac tgagaactta atcttccaaa atgtcaaaaa gaccatctta 12 0 
tgccccacct cccaccccag ctcctgcaac acaaatgccc agcacaccag ggtttgtggg 180 
atacaatcca tacagtcatc tcgcctacaa caactacagg ctgggagggg aacccgggca 240 
ccaacagccg ggtcacggca tcctctggta tcacgattcc aaaaccccca aagccaccag 300 
ataagccgct gatgccctac atgaggtaca gcagaaaggt ctgggaccaa gtaaaggctt 360 
ccaaccctga cctaaagttg tgggagattg gcaagattat tggtggcatg tggcgagatc 42 0 
tcactgatga agaaaaacaa gaatatttaa acgaatacga agcagaaaag atagagtaca 480 
atgaatctat gaaggcctat cataattccc ccgcgtacct tgcttacata aatgcaaaaa 54 0 
gtcgtgcaga agctgcttta gaggaagaaa gtcgacagag acaatctcgc atggagaaag 600 
gagaaccgta catgagcatt cagcctgctg aagatccaga tgattatgat gatggctttt 66 0 
caatgaagca tacagccacc gcccgtttcc agagaaacca ccgcctcatc agtgaaattc 72 0 
ttagtgagag tgtggtgcca gacgttcggt cagttgtcac aacagctaga atgcaggtcc 780 
tcaaacggca ggtccagtcc ttaatggttc atcagcgaaa actagaagct gaacttcttc 840 
aaatagagga acgacaccag gagaagaaga ggaaattcct ggaaagcaca gattcattta 900 
acaatgaact taaaaggttg tgcggtctga aagtagaagt ggatatggag aaaattgcag 96 0 
ctgagattgc acaggcagag gaacaggccc gcaaaaggca ggaggaaagg gagaaggagg 102 0 
ccgcagagca agctgagcgc agtcagagca gcatcgttcc tgaggaagaa caagcagcta 108 0 
acaaaggcga ggagaagaaa gacgacgaga acattccgat ggagacagag gagacacacc 114 0 
ttgaagaaac aacagagagc caacagaatg gtgaagaagg cacgtctact cctgaggaca 1200 
aggagagtgg gcaggagggg gtcgacagta tggcagagga aggaaccagt gatagtaaca 126 0 
ctggctcgga gagcaacagt gcaacagtgg aggagccacc aacagatccc ataccagaag 1320 
atgagaaaaa agaataagtg ttgccttgtt ttgtgtgttc taaatacttt ttttaatgaa 1380 
aaaatgtttt ttggttttaa tggtgttacg tggtttgtgt attaattttt tttcttgtcc 1440 
atatcacacc accaaaggct tttggaccat ttagcatcat gagcctaatg gctcagtcag 1500 
tcacctttct taagtgttgt gaagatggct cttttctttg gatcttgttt ctagccctca 1560 
actgctgaaa gcctcagaat ttagattaat tgagaaaaca cccacctctt ttagagaatt 1620 
atcctttgat gctgcagaat ctactcttac aatgccttcc tacagctcac tggggtgctt 1680 
accaaagcca tagctttaaa ccttcccagt ccccatcagt agcttcctga aagtctcctc 1740 
tcttgtttac ttctgcaaag ggtagcttct taaaaacgtg atcatgtatg agtatgtatt 1800 
tgttcactta ccctttttta cttttaatca atgtcagata ccaagagttg tgttaagctg 1860 
agtgtagtgt gtaactaact acacttggat cttactgatc cagaaatagt ccccatagtt 1920 
agagtagtta cttatgaagt ggttattaaa gtgaacacag cacatataca ttatctatac 198 0 
tgctttttgt tatgattaat actgggtatg ttctggtaaa tccatcctta ttgtatagaa 2040 
aaaaaattac ttttttacca ggttttccaa agacagaata gatcacaaag ctcaaggaat 2100 
ttaatattct tgtaatggac tagataattc aaactgatta gcccattcca gaagaaaaac 216 0 
agctgggaat taagttaatc cacttgaaat tgttttacaa taatcagaac atccaaacct 2220 
caaggctcag gatcccatag accagagccc acctttttga taaacttagt aaagtcttgg 2280 
agactagaag caagatagtt tgtgacacat aagcttccca aaaactagaa tagattttta 234 0 
ctgaatagtg gtatatctga tggtatatgt ttcttaaagg tccaaatgta ataaaaaaaa 2400 
aaaaaaaaaa aaaaaaaa 
2418 



<210> 37 
<211> 866 
<212> DNA 
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<213> Homo sapiens 
<220> 

<22l> unsure 

<222> 684, 698, 740, 755, 757, 776, 796, 805, 811, 824, 850, 856 
<223> a or g or c or t, unknown, or other 

<220> 

<221> misc_f eature 

<223> Incyte Clone No.: 1676367 

<400> 37 

gccggggtgc gcacttgggc ccccctggcc atggcggcga aggtggacct gagcacctcc 60 
accgactgga aggaggcgaa atcctttctg aagggcctga gtgacaagca gcgggaggaa 12 0 
cattacttct gcaaggactt tgtcaggctg aagaagatcc cgacatggaa ggagatggcg 18 0 
aaaggggtgg ctgtgaaggt ggaggagccc aggtataaaa aggacaagca gctcaatgag 24 0 
aaaatctccc tgctccgcag cgacatcacc aagctggagg tggacgccat cgtcaacgcc 3 00 
gccaacagct ccctgctcgg aggcggtggc gtggacggct gcattcatcg ggccgccggc 36 0 
cccctgctta ccgacgagtg ccggaccctg cagagctgta agactggcaa ggccaagatc 42 0 
accggcggct atcggctccc ggccaagtac gtcatccaca cagtggggcc catcgcctac 4 80 
ggggagccca gcgccagtca ggctgccgag ctccgcagct gctacctgag cagtctggac 540 
ctgctgctgg agcaccggct ccgctcggtg gcgttcccct gcatctccac cggcgtgttt 6 00 
ggctacccct gtgaggcggc cgccgagatc gtgctggcca cgctgcgaga gtggcttggg 660 
agcagcacaa gggaaccccg gggnaattta aatttccngg aacccgggta actttgcaag 72 0 
gcggttaacc aagctttttn cccctaataa attgnanttc cgtaatttaa gaagcntttg 780 
gggggtaaaa tcaatngggt caatnaagct nggttttccc cggnggttga aaaatttggt 84 0 
taatcccggn ttaaanaaat ttccca 866 



<210> 38 

<211> 1651 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_f eature 

<223> Incyte Clone No.: 1734119 



<400> 38 

ggaaggtggt 

cccgaggggg 

gggggcccct 

gggaacgagg 

cagcgccctg 

ttcgcccgcc 

gaaacagccc 

tcaggcctgc 

gcggggagcc 

tcagccccat 

ttctgtctca 

ccagaagccc 

cacccctggc 

ttgtggcatc 

gtcgcccagc 

ggcgtgccaa 

tgggtctgag 

agggcagggg 



ggcaggcacc 
agcccgccag 
tgaggcactg 
ggactgtgtt 
tggttccggg 
gccacaccgg 
caaaatacga 
cctggcctcc 
ccgtggaagg 
ggggagacgg 
cccggaagcg 
cccatggcga 
agggccgcta 
tctgagggcc 
acgcaggccg 
gcatcccaga 
tgcctgatcc 
tgcctgcccc 



ggcctgggca 
gccacgccgc 
aggcaccgag 
tggggaggtg 
cttcgcacag 
gaccctgcac 
ccactccaac 
tcgtccgcag 
gtcaggggag 
ggctggcggg 
tccttttttt 
gtgctggggc 
caactttttc 
cagattgcac 
gggcgctgcg 
gccgggctgg 
cctgcccccc 
aggagagtcc 



gcttccaggc 
actgagccac 
actggttctc 
gctttttcgt 
ctgtcccagg 
ggctgcttct 
cagcagttcc 
ggctgtctgc 
accaggtcag 
ataccgcccc 
gtgccaggtg 
ccggcggtgc 
cagcagcgga 
aaggtgacct 
gggctaagta 
gaccgccaaa 
aaaaaaagca 
caggcagtgg 



ctggctggcc 
agccccgggg 
cccgagagac 
ctgctgttga 
gatgggtcgc 
ggcctcgaca 
cgcctgcctg 
tggcttctgg 
ggcagctaca 
ccccggcttc 
tctacctaag 
cctgggggag 
gccctctggg 
ggccgtggcc 
ttaggccttc 
acgtcgtggc 
gaggtaggtg 
ttctcgtgcc 



acgaccacgg 
gccgcctccc 
tcggaaggtg 
ctgaacacta 
ctgtgctgcc 
gatgacaaaa 
cccgccactg 
gggcagaaga 
tttctggtga 
cccacaccac 
agggttggtg 
cagatggggc 
gggcctgtgc 
tgagggtgga 
ccagggaggg 
ctggatcctc 
ttgcaggccc 
agtggcaccc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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aggggcaagg acagccaacc cccacccttg ccacgtgtgg ggccacgtgg gcatgtgggg 114 0 

tgtgtgtttt taccttggtg aatctcacct gccaacgatt tctcgtgagt gccgaccacc 1200 

ttctccgacc atgttacgcc cgggcggcag cagcccccgg ccactgcaaa cccatgccct 126 0 

gggtccctcg gctcccccag ggaggcatcc ccgtgccaat gtcccccagt ggtggcagca 132 0 

gatcctgtgg ccggcctggc ggacgggacc cagtgatact tgtatattac acagtcctga 13 80 

tttcagacaa tttcaacctt aatctattta aaaaagaata ttctatacaa gctgttttta 144 0 

agccttttac catttgaaat gcatgtgttg tgcgcgttgg ggatgggagg aggggctgag 1500 

gagcggctca gtgtcacctc ccacagccac cggccctgac ccttaatcca gacaccgatg 1560 

gaagtcgact tttcatatct ttctcctgaa atgaactctg ttttaaattg gaataaattt 1620 

tgttcctaaa aaaaaaaaaa aaaaaaaaaa a 1651 



<210> 39 

<211> 1032 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte Clone No. : 1944813 



<400> 39 

tattaatgaa 

tagaaatgtt 

agttggcttt 

ttgttgccgc 

aaggtgattg 

ctgctcgtat 

ggtgggcaat 

acatggtgaa 

taaggatcca 

gagtttcatg 

atgattccat 

tcagaagctt 

ttgcaagagg 

aatgcatctg 

atcaacattt 

atgctctgtc 

tagacttccc 

atattttgta 



gagtgactgc 
tcattctgga 
taggacacta 
tgattttgtc 
cccattgatt 
agtctgtcct 
ggcagaaggt 
acaagtaaga 
tgatgacctt 
gattacagtc 
taaaataatt 
aaaggaagca 
actcttagca 
ggactgggtt 
aatgtacatg 
aagcacatca 
aaataatttt 
cc 



atgcaaacga 
cagctggtaa 
gtaagaaaat 
aaatctataa 
gttcagtttg 
tatgcgaatg 
tatggggctt 
aatcaagtgg 
aaaagaactg 
catggaagaa 
aaggaaaata 
gaaaatgtgt 
aacccggcca 
gacattgctc 
atggaaaaga 
gcaatcatag 
aatatacact 



caatatgtca 
aagtctgtgc 
atagttgtga 
aagccagaga 
ctgctaacga 
gaatagacat 
gcttaataaa 
aaacccctgg 
tagatctttg 
ctgctgaaga 
tgtctatacc 

ggcggattac 

tgtttgctgg 
ttgaactcgg 
taacttcaag 
attaccttac 
tttagaccca 



ggaaagaaaa 
cccaatggtt 
tctgtgttac 
cagcgaattt 
tgcaagact t 
taactgtggt 
caagccagag 
attttcagtt 
tcaaaaggct 
aagacatcag 
tgtaattgct 
tgggacagat 
atatgaggaa 
gactccttac 
gcaggaaaaa 
agaccattat 
cagtgaaacc 



aaagatccca 
cgatattcaa 
acaccaatga 
accacaaatc 
ttatctgatg 
tgccctcaga 
cttgttcaag 
tctattaaaa 
gaagcaacag 
ccagtgcact 
aatggagaca 
ggtgtgatgg 
accccactga 
atgtgtttcc 
agggtattta 
ggcatttgac 
acagaaggtc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1032 



<210> 40 

<211> 1797 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte Clone No.: 2683322 



<400> 40 

tfcgtagtggg gctggagcag agcctgccgc 
atccctcgaa tccaccagca cgagcgtccc 
atgggccctg gcggtggacg agcaggaagc 
taaggaagag aaaatcaaac cagataccaa 
agagaagaca gatgaagaag agaaagagga 



gaacccccgg agcccacgat ccctcgtgcc 6 0 
acccgcgcct gggaccatgg ccactgactc 120 
tgcggctgag tcgttgagca acttgcatct 180 
tggtgctgtt gtcaagacca atgccaatgc 240 
cagagctgcc cagtccttac tcaacaagct 3 00 
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gatcagaagc aaccttgttg ataacacaaa ccaagtggaa gtcctgcagc gggatccaaa 36 0 
ctcccctctg tactcggtga agtcttttga agagcttcgg ctgaaaccac agcttctcca 420 
gggagtctat gccatgggct tcaatcgacc ctccaagata caagagaacg cattacccat 480 
gatgcttgct gaacccccac agaatctgat tgcccagtct cagtctggca ctggtaaaac 540 
agctgccttt gtcttagcca tgctcagccg agtggagcca tcagacagat acccccagtg 6 00 
tctgtgcctc tccccaacat atgagctggc gcttcaaaca ggaaaagtga ttgagcagat 66 0 
gggcaaattt tacccagaac tgaagcttgc ctatgccgtt cgaggcaata aattggaaag 72 0 
aggccagaag atcagtgagc agattgtcat tggcacccct gggactgtgc tggactggtg 78 0 
ctccaagctc aagttcattg atcccaagaa aatcaaggtg tttgttctgg atgaggctga 840 
tgtcatgata gccactcagg gccaccaaga tcagagcatc cgcatccaga ggatgctgcc 900 
caggaactgc cagatgctgc ttttctccgc cacctttgaa gactctgtgt ggaagtttgc 960 
ccagaaagtg gtcccagacc caaacgttat caaactgaag cgtgaggaag agaccctgga 102 0 
caccatcaag cagtactatg tcctgtgcag cagcagagac gagaagttcc aggccttgtg 1080 
taacctctac ggggccatca ccattgctca agccatgatc ttctgccata ctcgcaaaac 1140 
agctagttgg ctggcagcag agctctcaaa agaaggccac caggtggctc tgctgagtgg 12 00 
ggagatgatg gtggagcaga gggctgcagt gattgagcgc ttccgagagg gcaaagagaa 126 0 
ggttttggtg accaccaacg tgtgtgcccg cggcattgat gttgaacaag tgtctgtcgt 132 0 
catcaacttt gatcttcccg tggacaagga cgggaatcct gacaatgaga cctacctgca 1380 
ccggatcggg cgcacgggcc gctttggcaa gaggggcctg gcagcgaaca tggtggacag 144 0 
caagcacagc atgaacatcc tgaacagaat ccaggagcat tttaataaga agatagaaag 1500 
attggacaca gatgatttgg acgagattga gaaaatagcc aactgagaag ctccaccagc 1560 
caccgatgcc agccctggca ctgcccctgc acaggagaca agtgcgttca gggcacaggc 162 0 
cccgacatca ccccaaggac aacggcacaa gtagagagaa actacctacc tcacttcaaa 1680 
ttatgtttgg acttgacaaa aatgtatgca aatgatgggg gatggtagaa aaaaattatt 1740 
tacacaacct tggaagatta ggcatgaata cacagagatt tacctttaaa aaaaaaa 1797 



<210> 41 
<211> 1987 
<212> DNA 

<213> Homo sapiens 
<220> 

<22l> misc_feature 

<223> Incyte Clone No. : 2684552 
<400> 41 

tgacagagga ggctccgtgt ctgcagctag tgtgtcaact cagcgtttct cctctcgtcc 6 0 

ctggaagagc taaagatggc tgaatttcta gatgaccagg aaactcgact gtgtgacaac 12 0 

tgcaaaaaag aaattcctgt gtttaacttt accatccatg agatccactg tcaaaggaac 180 

attggtatgt gtcctacctg taaggaacca tttcccaaat ctgacatgga gactcacatg 240 

gctgcagaac actgtcaggt gacctgcaaa tgtaacaaga agttggagaa gaggctgtta 300 

aagaagcatg aggagactga gtgccctttg cggcttgctg tctgccagca ctgtgattta 36 0 

gaactttcca ttctcaaact gaaggaacat gaagattatt gtggtgcccg gacggaacta 42 0 

tgtggcaact gtggtcgcaa tgtccttgtg aaagatctga agactcaccc tgaagtttgt 480 

gggagagagg gggaggaaaa gagaaatgag gttgccatac ctcctaatgc atatgatgaa 54 0 

tcttggggtzc aggatggaat ctggattgca tcccaactcc tcagacaaat tgaagctctg 6 00 

gacccaccca tgaggctgcc gcgaaggccc ctgagagcct ttgaatcaga tgttttccac 66 0 

aatagaacta ccaaccaaag gaacattaca gcccaggttt caattcagaa taatctgttt 720 

gaagaacaag agaggcagga aaggaataga ggccaacagc cccccaaaga gggtggtgaa 78 0 

gagagtgcaa acttggactt catgttggcc ctaagtctgc aaaatgaagg ccaagcctcc 84 0 

agtgtggcag agcaggactt ctggagggcc gtatgtgagg ccgaccagtc tcafcggcggt 900 

cccaggtctc tcagtgacat aaagggtgca gctgacgaga tcatgttgcc ttgtgaattt 96 0 

tgtgaggagc tctacccaga ggaactgctg attgaccatc agacaagctg taacccttca 102 0 

cgtgccttac cttcactcaa tactggcagc tcttccccca gaggggtgga ggaacctgat 1080 

gtcatcttcc agaacttctt gcaacaggct gcaagtaacc agttagactc tttgatgggc 1140 

ctgagcaatt cacaccctgt ggaggagagc atcattatcc catgtgaatt ctgtggggta 12 00 



46/66 



WO 99/64596 



PCTYUS99/13281 



cagctggaag aggaggtgct gttccatcac caggaccagt gtgaccaacg cccagccact 126 0 

gcaaccaacc atgtgacaga ggggattcct agactggatt cccagcctca agagacctca 1320 

ccagagctgc ccaggaggcg tgtcagacac cagggagacc tgtctfcctgg ttacctggat 13 80 

gatactaagc aggaaacagc taatgggccc acctcctgtc tgcctcccag ccgacccatt 1440 

aacaatatga cagctaccta taaccagcta tcgagatcaa catcaggccc cagacctggg 1500 

tgccagccca gctctccttg tgtgccgaag ctcagcaact cagacagcca ggacatccag 1560 

gggcggaatc gagacagcca gaatggggcc atagcccctg ggcacgtttc agtgattcgc 1620 

cctcctcaaa atctctaccc agaaaacatt gtgccctctt tctcccctgg gccttcaggg 1680 

agatacggag cxagtggtag gagtgaaggt ggcaggaatt cccgggtcac ccctgcagct 174 0 

gccaactacc gcagcagaac tgcaaaggca aagccttcca agcaacaggg agctggggat 18 00 

gcagaagagg aagaggagga gtaatggtgt ctccagagac tttaacaatc gggtcccggc 186 0 

ttctttgtga ccggaagaac cttgtcgctg ttgcagggcc acctctcttg gccctttggg 1920 

gggggagagn tttttccaaa gttataatat tttccaaagg tatggcccat tgtgggctct 1980 
ttaaggg 1987 



<210> 42 
<211> 2295 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> 2228 

<22 3> a or g or c or t, unknown, or other 



<c220> 

<221> misc_f eature 

<223> Incyte Clone No. : 2830310 



<400> 42 

gcgggatagg 

tggcgcggag 

gcttccccca 

tcccgggaac 

cactcgagga 

ggaacagaca 

catacttcag 

gaactcagtt 

atctctctag 

accaaggtgt 

agatcctagg 

tttcagtgat 

atagacattg 

aaattaaaca 

gagcagagat 

caggaaggaa 

cagtgaaccc 

gattagtttt 

caaatacgca 

tgctactggc 

ttatgtgtgg 

gaaaattgaa 

gaagcaaaca 

cagtttgtca 

ctgtcattcc 



taaacccgcc 
ttcatgggcc 
acccagcctc 
cgagcgtcgg 
gtctgggcag 
tatttctagt 
tagtaggcat 
ccagtactgt 
ttacgtcatg 
gataaagcgg 
agacaaaaat 
gtcctataat 
ccggcggcca 
ttttgatgca 
caggccaagt 
acctgatggc 
agtggaattc 
actcttacag 
tctgttgtat 
agagatttcc 
tgactttaat 
ttatgaagga 
gaggtcctag 
tctgtttatt 
cgaagtgcca 



gcgcactggc 
gccggccagg 
cagaccagcg 

cggggaggcc 

agactggact 
tgtatgaggt 
ttttcactaa 
aactggagac 
aacgctgagg 
aattgggaat 
gttgatccca 
atactttcac 
gtattacact 
gacgtacttt 
ttggaatcac 
tgtgctattt 
ttccgccctg 
cccaaaattc 
aatccaaggc 
agtgttgccc 
tctgttcctg 
cttcccatag 
tgacagctga 
cacattactt 
taactgtgga 



gggacgctgg 
cttgtgccag 
ctggcctgct 
gcctctccgc 
acacgtggga 
ggcctggaca 
attggagacc 
ctgacaacct 
gagatgagcc 
atatatgtag 
aatgtgaaga 
aagatttact 
ggagttttag 
gtttgcaaga 
tgggttatca 
gcttcaaaca 
afcatttctct 
catatgctgc 
gaggtgatat 
accagaaaga 
gttctccact 
gaaagacagt 
aaaattgtct 
tcctgacact 
ttatattttc 



gccaacgagg 
tggctttgga 
ccacacactc 
aataccccat 
gaatctgcaa 
ttactctcga 
accctgtttg 
gagccagaca 
ttcatcaaaa 
ccatgataaa 
cagtgagaac 
ggaagataac 
gtttcccaat 
agttcaagaa 
ctgtgaatat 
ttccaaattt 
gttggacaga 
ctgccctgca 
taagctgacg 
tggcagcttc 
atatagtttc 
gatctgacac 
tcaaatttac 
ggaattccag 
tactctgcag 



tctggggtct 
tccctggcct 
cgcagcaccg 
gtttccccat 
aggtgttgct 
gctccttacc 
tttgagtcta 
tctttgattc 
cgaagaaaac 
gaaaaaacga 
aagtttgact 
tctcaccttt 
attctgaaag 
gatcattatg 
aagatgcgga 
tcactcttgt 
gacaatgttg 
atctgcgtag 
caattggcaa 
tgccctattg 
ataaaggaag 
aaacacagct 
agcaccattt 
aagtgaccac 
aaaaggaaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
12 00 
1260 
1320 
1380 
1440 
1500 
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tgttgctggg cacccaggag ctgaagttgc 
actgtcactt cttacagaac aagacttatg 
ctcttcagat catctgcctt tattggcaaa 
cacatactaa ttttctttcc aatttgtatt 
tatgcatgtt gtttattttt gcactgtgga 
aaactccata tcagaagaaa taactttata 
cgtgacaagt gagctctaaa ttctctttat 
aaatcctagt aaaattgaca gtgattttta 
taaaaaattt catttcataa ttttggcaag 
tcttgcaaac tacaagttgt ttcctttcca 
attgagttct aaatagttta tcttagaaag 
cattgtttag tttgtttttg aaggggtttg 
aatccacnta ttactgtcct gagccagtaa 
gaaatataag ccagc 



tttggttggt ggcttgaaac ttctagctag 156 0 
gactgttaat ggacttccaa acgaaaataa 162 0 
gttcagactt gagctctgac tctctttgat 1680 
gtttttcaaa gaatgtaaag ttcttaagtg 1740 
gattctgaag cggttatgtt agatgctttg 1800 
acaatttttt ttaataatga aaaatatttt 186 0 
tgtaaaagag atgtaaaggt tttatattct 192 0 
aatataatgc atcttccttt gtctgcttag 1980 
ctctgtagtrg gatccaaagt atctttgagt 2040 
gaaggcttga tttcattagg agacccctct 2100 
ccttgggtca ttcacaggta tccaaccagc 2160 
ataatgcttt ttaagttgta cagaatgctt 2220 
tatgcctgca ticgtggtggg gaatgtttgg 22 8 0 

2295 



<210> 43 

<211> 2819 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte Clone No. : 2963346 

<400> 43 

taggacttca acatggcggc tgcggcactg 
gcgggcagag ttggaggtgg tggcgttcgc 
ggaccgagcc tgggaggccg gccggtgcca 
agcggcattc aggttctaaa tggcttctaa 
cagttacctt gatgatgtcc catttaagac 
tggtctacct attggcttct ccttgcctga 
tgacttctct ttggaaaaga aaaccattga 
agccgagcgg gaagcagagt gcaaaattgc 
tggcccagag ggcgatagca aaatgagctt 
acctcctatt aaccccatcc tcgccagctt 
ggtcagcagt agtgccacga aacagaaagt 
caatcttgct gactttgagt gtgaagaaga 
tgatgagaag gaagagctga gaaatattct 
gttattggac aataacttgc ccaggggagg 
cctggcatcc ttggaacggg caaccctaga 
tataacctta ccacagttgg gcaactgtga 
cccccctata cctgcagtaa gcaatatcaa 
tgacagcaat cagaagacag ccaagctggc 
caatggcacg ttccagaatt ccctaaagcc 
tgggcatcac actcttgggc tttcagcttt 
cctgacatcc tcccagatgc cttccctctc 
acctccaaat actggtccca cggtcacccc 
catgcccagc tgtccccagg cctattctga 
gtgtgtggag acggtggtca acatgggcta 
gaagaaagga gagaatattg agcagattct 
tgagaagggc ttcgaccctc ttttagtgga 
agaaaagatg atggagtttc ttcagttaat 
gaaagacatt aaggaagttt tgctattaca 
cctcatggct cgggcaggag ccagctgaga 
caccatccct gggaggccct gcagagccca 
gattttcttt tgggggttag aaggtcaggt 



gcggtggcta cggtgacggc ctggcccgga 6 0 
tctccctagg ggctgtcggg agctcagcgg 120 
gcacctttcg gcttctgaga cggcggcagc 180 
gaagttgggt gcagattttc atgggacttt 240 
aggagacaaa ttcaaaacac cagctaaagt 3 00 
ttgtttgcag gttgtcagag aagtacagta 360 
gtgggctgaa gagattaaga aaatcgaaga 42 0 
ggaagcagaa gctaaagtga attctaagag 480 
ctccaagact cacagtacag ccacaatgcc 540 
gcagcacaac agcatcctca caccaactcg 600 
tctcagccca cctcacataa aggcggattt 660 
cccatttgat aatctggagt taaaaactat 720 
ggtaggaacc actggaccca ttatggctca 780 
ctctgggtct gtgttacagg atgaggaggt 840 
tttcaagcct cttcataaac ccaatggctt 900 
aaagatgtca ctgtcttcca aagtgtccct 960 
atccctgtct ttccccaaac ttgactctga 1020 
gagcactttc catagcacat cctgcctccg 1080 
ttccacccaa agcagtgcca gtgagctcaa 1140 
gaacttggac agtggcacag agatgccagc 1200 
tgttttgtct gtgtgcacag aggaatcatc 1260 
tcctaatttc tcagtgtcac aagtgcccaa 1320 
actgcagatg ctgtccccca gcgagcggca 1380 
ctcgtacgag tgtgtcctca gagccatgaa 1440 
cgactatctc tttgcacatg gacagctttg 1500 
agaggctctg gaaatgcacc agtgttcaga 1560 
gagcaaattt aaggagatgg gctttgagct 1620 
caacaatgac caggacaatg ctttggaaga 1680 
ccaggccctg cctaggccct gccgcagaac 1740 
cctgtgggga aagagaaggg gcagcttccg 18 00 
gtggagactg ctcgccagtc tctgtgagcc 1860 
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taggccctga 
tggctccttc 
ccggagagac 
ttgctgagag 
aggtgcttcc 
gaaagattct 
tggttctgca 
gccttttctg 
ttggaccaca 
gaaatcattg 
ggccagcact 
aggatgacac 
tatgttgtac 
tttgctaact 
aaaaaaaaaa 
atgcgacatc 



gctggggagg 
cgtattaaac 
taccctagtc 
gggctgggga 
cccttcccct 
tccagggttt 
atatctctga 
ccaggcagtc 
gtcctctgct 
gccccttccc 
aactccacct 
aatgaataac 
atcattaaag 
aactattttg 
aaaaaaaaaa 
atagatcatc 



tggggaagat 
gcatttgcat 
tttctggggt 
gatggggcgg 
gagactggtg 
tattttttcc 
ggtgcaaaga 
accacgcttc 
acccagggtt 
agcacattga 
ctgttctcct 
acctagtcat 
atctaaatac 
attcttcaga 
gggcggcaga 
tatagtagtc 



tcgggcatgt 
tttgagaagt 
gtttatgtcc 
gagggccaga 
gactgaactc 
cctcctaaca 
atgcactttt 
cctaccccag 
ttagagcccc 
atgggtaagc 
tgaacagctt 
agaaatcagt 
aaaggatata 
gagaactact 
tctagaggaa 
acctaaattc 



gagtgccccc 
gtccttccca 
tcagctgaag 
ctcagtgctg 
cagtcaagtt 
aagtctcata 
ccctatgggg 
cctgtttctt 
tgctctagga 
agacaggcca 
cccctccagc 
ctctctggtt 
cagtcttgaa 
aataaaaatc 
ccaagcttaa 
aatttcagtg 



agaactgtcc 
cttcagccct 
cctggcctag 
ctgtggagct 
gagttcaagt 
gtgttaacac 
cccagagttt 
ttggcttggt 
aacagtttaa 
tgatttagtt 
ccactgcttt 
tgttttgtat 
tctaaaataa 
taaaaggtaa 
gtaagcgagc 
ccaacaaag 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2819 



<210> 44 

<211> 1459 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte Clone No.: 2994234 



<400> 44 

cttcttgaga 

aggctggccg 

ttggagctga 

ttggaaagaa 

agactgtaag 

tgcagccatg 

aaagatgcga 

taaacagtcg 

cattaggaaa 

agaaattgcg 

ggaacgctgc 

ctggcagttg 

atctttcttt 

gttgggacta 

taatcctgtt 
aggtggctct 
tattttgtta 
tttcaccagc 
ttgcccaaga 
tcatgacctg 
tttagaagag 
gacgccaatg 
ttcctcaggc 
gtttgaagct 
tggcaccatt 



ccgaggaccg 
gagagagagg 
cgccgacggt 
tcccaagtgg 
atagaatgag 
ttgtggaagc 
tcacctccaa 
agcaaagaaa 
attcgtagat 
aatattttac 
ccggaagcaa 
gtctgcaaaa 
actattaaag 
aaaaatgtgg 
gagaagaata 
gaggccaaca 
aattctccca 
tttgaaattc 
agtatacaga 
aagcaattag 
agaatgcaag 
gttcttgaat 
tacagaataa 
aattttggca 
aaatgttga 



aaatcccggc 
gaaggacggg 
ttattgcagg 
atgaactgaa 
tccatttaag 
tgctgctgag 
aatacagacc 
atacaagaac 
taaaaggatg 
aagaactagg 
ttgtctgtag 
atgaggaaga 
accaagagaa 
tcattagcag 
agcaaatggt 
tgaaagtttg 
cagctataaa 
tccagcttct 
atagtatttc 
ttttgaaatg 
gattattgag 
taacaccaca 
aggatggaca 
aaattcaggc 



tccaggcctc 
gcctggcccc 
gaactgacaa 
tatctggatg 
tcccaggacg 
atcccagtcc 
tttcttagca 
agtggaaaag 
ggtactttta 
tgccgatgag 
tccaaccgct 
gttaatcaag 
ccagaagctg 
acttttgaca 
aagaattctc 
gctactaaaa 
ggaaacacta 
atccaaactc 
cttctctaaa 
tcctgccctt 
agaaggaatt 
gatagtacag 
tctagcaaat 
caaaaaaagt 



ggggactgcg 
cgggactccc 
gatcacattt 
aggacaagat 
gtggaaacta 
tgcaggctgt 
tgcttcacct 
ctctataaat 
gaggatgaaa 
actgctgtag 
gttaacaccc 
ttaatagagc 
aatgttcagt 
gctgcaccta 
caagagagtt 
ttgttaagcc 
gaatttctcc 
aaaggatttc 
aatgctttta 
ttatattatt 
tccatagctc 
tacaggataa 
ctaaatggat 
aaggecafcta 



gactgtgggg 
tgtgccttgc 
tgagaagaag 
ctgtggggag 
gctagtagat 
gttctttcag 
atacaactga 
gttcagttga 
cctatgttga 
ccagtatttt 
agagaaaact 
agtttccaga 
tctttcaaga 
atgtttttca 
atctagatgt 
aaaacccatt 
aggagcaagg 
tttttcaact 
aatgcacaga 
ctgttccagt 
agataagaga 
ggaaactgaa 
caaaaaaaga 
tttaaccctg 



60 
120 
180 
240 
300 
3 6 0' 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1459 



<210> 45 
<211> 2733 



49/66 



WO 99/64596 



PCT/US99/13281 



<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> 61, 63, 69, 92, 100, 151, 165, 179, 182, 198, 218 
<223> a or g or c or t, unknown, or other 



<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 4115958 



<400> 45 

ccggaaaggg ggaaaacctt tgggaaattt 
nangccccnt ttttggggag gaggacctta 
cccaagggac ccggccaaag attgtttttt 
cnccattgat cccaaccngc atcacatatc 
ccagttcggt tgcatctaaa gtgtttagga 
aaaataagga tgagtcctca gtttccattt 
tcagtgatgg gaaaaactta atacctaata 
aagaaaagtg ttctttgaaa catcaaacaa 
tcatccaaag tacgtatcag gagacacaga 
ttttgtctca gcacaaagaa aataacttag 
tgagcaatgg aattgataaa cagtattcaa 
caaataagta taaggaaaaa ataataaatg 
ttcagtctga tgattccaaa aaaaatgact 
ctgaatattt gaacaaatat gaaagcatgt 
catgtgagca aaatactcaa cttaatagca 
gtttcaaatg taaaaagtca gaccaattaa 
ctaatgaaaa atcccaaaac tattctcaga 
gtgagcaacc aaaaaataca gaggtattga 
gaaaacataa ttttcattcc ttggatggaa 
gattgcctct tctttcaagc cctgttcctc 
tggattcttc attactacat ctgaaaagct 
tcgtttcgat cctctcggca ttgttgatgg 
tttctgcccc acacatttga ctcttccagt 
caatgctccc tctccttata tgggtgtgat 
tcgagtacct ccttcaggaa caatacaagt 
aatcattctc tcacatacat atagaactta 
actttgacct tatttaatcc taataagact 
ttacgagata tgccagccaa tcatcagaca 
gttaaacaag aagtgaagag aagtgttaat 
ttattacgct acctcataca tctgaggttc 
catagagacg tacggctcct gttctctaga 
tatgaactca aatcttacac tgaatcacca 
taaggagtga tgatttaaag tatttactca 
agaatggagt gtaccaaatt aacaatcagg 
agtttttatg aaaggattcc tgaaatgaaa 
catgtcctaa tcctttttgt actgttgaaa 
acccccagtt agattagtgt ttacacattt 
taatactcaa gtgatgtttg tctctagtgt 
atactatgta tttttgaeat taatgttgaa 
tgtaatgtgt taagtgctgt tcatttaagt 
ctttaaaagt aacacactat gcaagcatgt 
tataacacca tatccaggtt gctatctcac 
gtgcaatgtg gactaagctg cttcactttc 
catagccagg attccttatc cccaaaacag 



cccaaaaggg ccttttttaa atccccccaa 60 
ancccctttn gaataggggc tcttttaacc 120 
nccaagggat ttgcnccagc catttgggng 18 0 
cccccggncg gtcattcaac atgcatgact 240 
gttcatatga agacaaaaat ttgttgaaga 300 
ctcacacaaa atgttccttg ttaggagaca 360 
aatgttttac ttcttttaaa aataatagta 420 
gaaatcagtg tcagaacaat cctagtgaaa 480 
acaaaagttc tagtttatca acttcctcaa 540 
atttgacaag cagattcaag gagcaagaaa 600 
attgcaccac tattgacaaa cagatttgta 660 
agaactataa tccaaaattc tttggcaatc 720 
caaaaataaa agttactgtg ttggaaatgt 780 
cctcaaataa agactcaaaa aggcctaaga 840 
tagagaatta tctcaataaa gataatgaag 900 
aaaatgaaca agataagcaa gaagatccaa 96 0 
gaagaagtat aaaagactgt ttgtctacat 1020 
ggactacact gaaacattca aatgtgtggc 1080 
cttcaaccag agcctttcat cctcaaactg 1140 
aaagaaaaac acaatcaggt tgctttgatc 12 00 
tctcatctag aaggaatctg tcttgaacta 1260 
ttttactgcc gaggtagggg caagtggtgc 132 0 
tgaagtgtca ttctacagtg tttcagatga 1380 
tactttagag tcccttggta aaaggggtta 1440 
ggtatgtgtt ttatagccta caattgcaat 1500 
gccctttctc ctgctactgc tgctgcaaca 1560 
gtggtgaaga tgtttgttgt gatatatgat 1620 
ttcctacgac aaagaacttt ttctgtacct 1680 
aaagagaaca tccgacacac agaagaacgg 174 0 
cagagttcta aatctggaaa gatctacctc 1800 
aagtcaatgg aggttgatag cggtgctgca 186 0 
acaaaccctc agttttcacc aagatgttga 1920 
gtacccaagt ttgcaagtaa aaattagcat 1980 
agagtggatt ctctcctgtt atcctggacc 2 040 
tccatatatt ccatgtagac tggaaaaact 2100 
ccacttcatt ggacatgttg caatagcaaa 2160 
tctcagttat ttaatattta atgttttcct 2220 
tctaatgtag cacaaatcct atgtaaaatc 2280 
atcaaatata tgcacaagtc tttaattttg 2 34 0 
tattgaaaat gagaataaaa tgttgagctt 2400 
gtacttttta tatctctcat gtttagtttt 2460 
atagtagtcc tttaacatat tgtattagca 2520 
cctttgcaag ttcagatcat catgcccatt 2580 
ttctattttt ccttaatcac tactatagag 2640 



50/66 



WO 99764596 
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tctttacatt aaattactgt cgtatgctag ataattttct caaattgtta aaagaatatg 2700 
tactttggaa acaaattagt atttatattg tga 2733 



<210> 46 

<211> 2177 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte Clone No. 



779255 



<400> 46 

attgctccaa 

gctaagccct 

cagcacaccc 

cactgctcca 

ggggccaagc 

cagcagcaca 

gagaaagcct 

aaacccttca 

cacctgtcta 

gcatacacat 

cagcaacagg 

caagcccagg 

cagcagcagc 

ttccagtctc 

ccaccccagt 

tgcctcactg 

gtgctgccac 

ggtgggaaaa 

acggctttcc 
ccctcctgat 
gcaaagcttc 
accttgtcca 
gatgtaagac 
cagcctcttg 
tgagagccat 
gagatttggg 
tccccaaggg 
ttttacggat 
tctccccaac 
acaggggtgt 
taggtgccac 
ttatacccac 
tctctgagaa 
acccctgtcc 
ggcagatttc 
atttaaaaat 
attgtataga 



acgacgccaa 
agagttgcca 
ggatccactc 
agaccttcgc 
cc£aca.a.ctg 
cacgaatcca 
tcacacaact 
agtgccacaa 
cgcacacagt 
cagaaacata 
tgcaggcagc 
ctcaggctca 
agcagcagca 
ctggggcagc 
gttcctttga 
tcaccaccag 
ccactgggaa 
gtccttttct 
ttcacaggat 
tgcaaaggaa 
taaaactggg 
gtgtggccac 
tagaggctgg 
ttctattctt 
ttcttcccct 
gtaggggcta 
gccagaaatt 
ttccttgaag 
ccacccccaa 
tgcggagaag 
agtcagatgc 
tcatcacggc 
ttttcatatt 
tctagccaca 
agaggaaagg 
gaagaaaaaa 
aatagag 



cttctactac 
cttctgtgaa 
caagatgcac 
caacacctcc 
ttcctactgc 
cactggtgat 
ctccaatctg 
ctgtcatcgg 
gaagcatgcc 
ccttatgaaa 
agcagcagcg 
ggctcaagcc 
gcagcagcag 
cccccagggt 
cctgaccccg 
caccatccag 
gaggaagaag 
tccttgacag 
accatccttt 
tactgagctg 
ggtcggtgct 
ttaccctatc 
ccctcccaga 
gcctgcaaaa 
acatcgtctc 
gacctccttt 
cccaagttgg 
cctctctcct 
ggagctgtgg 
ggagagttca 
cggacacgga 
catctttgtt 
tgatgttttg 
gcatttttct 
aaattaaaaa 
agaaaaaaac 



ctggcccttc 
aaatccttcc 
acggagacca 
tacctggccc 
cagaaggcct 
agaccataca 
cagtcccaca 
gcgtacacgg 
aaggtgtaca 
catatgcgca 
gcagcagtgg 
caggcccagg 
cagcagcagc 

gggggtggtg 

tataagacgg 
gtggagcacc 
tagtcctggt 
gccttggctc 
ttctgaactc 
atggtgtcat 
caagggaagg 
cttaccct.cc 
taacagagaa 
gaacagaggt 
actttgcttc 
tatttgaagg 
tcacaggtgg 
tctctgccta 
gaggctttgt 
aggcaaacgc 
tttatatata 
gtaaccattt 
tttatttttg 
ttttgtcttt 
aaaaatcagg 
ttatgtacaa 



acatccgtat 
gccagctctc 
tcaagcccca 
agcacctccg 
tccgccagct 
aatgtgcaca 
gacggcaaca 
atgcagcctc 
cctgcactat 
aacacaaccc 
cccaggccca 
cccaggcctc 
agcaacagcc 
gggacagcaa 
cggagcatca 
tggccagctc 
gtcttctttc 
catctccttg 
ttcttcaaaa 
ccagcagcct 
atttgctatg 
ttatcctcaa 
aagggagccc 
ttctcaaatg 
ctattgactg 
gggcaagggc 
cttagaagtg 
caaagaccct 
gttatctgtg 
aaggactgga 
aatatatata 
ctgtgtttat 
tccttttttt 
tttttttttt 
aaaccagttg 
accaaggggt 



acactcaggg 
ccaccttcag 
caagtgcccg 
tatccactcg 
ctcccacctt 
cccaggctgt 
caacaaagat 
actagaggtg 
ctgcagtcgg 
gcctgatctt 
ggctcaagct 
ccaggcatca 
accaccacac 
tcccaaccct 
taaggacatc 
ttagagatcc 
tccaactctt 
ggcctctgtc 
ggaacatcag 
cccctcccaa 
acctcataga 
agtttgggct 
caaatgcaac 
cctcagtccc 
ctggtagaag 
tgagatgtgg 
tgtgttatgg 
atactctcag 
aaactccaaa 
cttagctccc 
tataaatata 
aaatgcatta 
ccctctctcc 
tttaaatcat 
ttataaagta 
tttttagaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2177 



<210> 47 

<211> 2685 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_ feature 

<223> Incyte Clone No. ; 1303605 



<400> 47 

gctctgtttg 

atgaactctt 

ggcatatctt 

cagagcaggc 

agaacactgc 

tgaacagcat 

accaagtttt 

gggatagcga 

aagatgactt 

caggaaagaa 

ttaaaatgat 

cattctacag 

agggccactt 

ttgatattaa 

ctggcgttgt 

cctcagaaag 

tgctgcagga 

aggagatcaa 

cccagctaca 

tgccaacgca 

acttggcatg 

aagtaattat 

ataacacagc 

atgggaccag 

atctagtaaa 

ttgggtacct 

ctactggcag 

tggagctcca 

tttctgtctt 

ctgtgacttt 

cagctcactc 

agcctttcga 

actccccggg 

ctccttggga 

tgagggtagt 

tgctttgctt 

ccfctttctgt 

taactttggg 

ttcttaaaca 

ttctgggatg 

tagaaaccta 

gtagcagtaa 

ctctactttt 

ttcacttgta 

gtcttttata 



acttcatgca 
ctaaggctac 
tttcttgcac 
agtggaactc 
caagtataac 
tgttgcaaaa 
gcagattgtg 
agggcagaaa 
tagcccaatt 
tgtatatttg 
cagcaccagg 
gtgtgacaca 
ggcatctctg 
aagaacatca 
ggacctcgtt 
cagcatgaac 
gaagctacgc 
ctccaccttc 
gtcatgtccc 
caccagtctt 
tttccatgaa 
tccaacaccc 
tactcctcac 
gaggagctct 
ggaataggta 
tttaggaaat 
cagattttgt 
accattttta 
tataacttga 
taaaaagttg 
aatgtgggtg 
ttataaaata 
ccttctgagc 
agggaaagcg 
taactcatca 
gaactgttca 
ttttgggggg 
ttgtcccctc 
tgttttccac 
tcgggagacc 
cactgtcact 
tgtgtttatg 
ttatggcaca 
aaaaaataat 
ttcgggataa 



gaatttttaa 
tgtaactgaa 
atcaaggagg 
agtgccctcc 
tatgaggagc 
cataaggagt 
tcggcaatgg 
ctgctcattg 
aataggatag 
acggtcacca 
gcggccagcg 
gtgaccagcg 
tttctgaatg 
aaggaggtgt 
tcaagaagca 
tgcagcagct 
aagctgaagg 
tgtccctgtg 
gtctgcaggt 
ctcaatctga 
ctgcactatt 
atctgccatg 
tgcaaaaaca 
gggacgcaga 
aagtctttga 
gatgttgtaa 
aagaattact 
ataggaaagt 
tcaagaatga 
acaatgttgt 
gctccctatt 
ctaccaatct 
tggtggaata 
gagcttgctg 
cttctcccaa 
agccttttat 
ggagttttgt 
tgtatgtttc 
tcccacttgg 
tagatgacct 
ttactgagat 
catttgtttc 
tattagcata 
gtttccacgt 
taaagacttt 



agtttttagg 
ttccaaccca 
ccctcttggc 
tggcccagac 
tctgtgccaa 
tggaggggac 
aaaactatgg 
gggttggacc 
cttatcctgt 
aggaatctgg 
ggctctaccg 
ccgtgatgat 
aaaacattaa 
atgaccatgc 
accagagccc 
gcgagggcct 
aagccatgct 
gccacactgt 
cgcgtgtgga 
ctgtaatcta 
ataaactatc 
cgatgttaaa 
tatccatgcg 
cacattcctt 
tgtcagtgaa 
gtctccttaa 
tttaagaatt 
cttttgtaaa 
ttggaaggca 
cagatttaaa 
cctttacgct 
tgttataaga 
ttttatttca 
agtgagagat 
gcactcgatc 
agccttacca 
tgtgtttttt 
gaaggggttt 
gcattttgga 
tatcgggtgc 
ttctgagtat 
tttgcacaga 
taagccttta 
aaagaactct 
aaagcaaaaa 



tttcttgaaa 
cagatatgtt 
aggccacctc 
caagtttgga 
ggagctctcc 
cagccaggct 
catagaatgg 
tgaaggaatc 
ggtgcagatg 
gaacagcatc 
agcgataaca 
gcagtatagc 
ccttggcaag 
caggagggct 
ttcacactcg 
cagctgccag 
gtgcatggtg 
gtgctgtgag 
gcatgtccag 
atctgttgtg 
aaaatgatag 
aaaaaaaaaa 
tagaatcaac 
ggatgttgat 
gtggcaacat 
tgtatcctga 
tcattctttt 
ttgttgtcgt 
aacaggttta 
ccagtgtggc 
ccccctatcc 
ttactgtgga 
gactgaaaac 
ggagcctcat 
ccagcttcac 
taagtattta 
taaagtaagt 
tggttctttt 
agctggtcag 
aatactagct 
acttttcata 
cattttgtca 
ttccaagagg 
gttatatcct 
aaaaa 



atgtaatttc 
acacacgatt 
ttgtgttccc 
gactacaacc 
tctgccacct 
tcagctgaat 
cattctgtgc 
tcaatttgta 
gccacccagt 
gtgctcttgt 
gagacgcacg 
cgtgacttga 
aaatatgtct 
ctgtacaatg 
cctctgaagt 
cagacccggg 
tgctgcgagg 
agctgcgccg 
cacgtctatc 
cttttgtztgg 
attgtggaga 
aggaagaaaa 
aactccagtc 
tttttttatg 
agccaaaaag 
ggtaagtttc 
tgtatggtca 
tttaatgtca 
caaatcaatt 
tagtaaaaag 
ctaccccaca 
gtagtcaagt 
agagagcact 
ggtgtacaac 
ccactggtgt 
gatatggtgt 
gcttaagtat 
tgcttctgtt 
ctagcaggtt 
aaggtaaagc 
ttgccttaat 
aatattaaaa 
tatttatttt 
agaggactct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2685 



<210> 48 
<211> 2408 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte Clone No.: 1611167 

<400> 48 

ggctgcatcc cggccggcgg ttccggagcc tcgcgtctgg ggaggcgcgg gccggcacgc 6 0 

tcgagcttag ggagatctgc cttctggaga ctgcgccgtc ctcccgggag agccagaaag 12 0 

aggacatggc tgctgggcag cgggaagcga ggccccaggt gtcactgacg tttgaggatg 180 

tggctgtgct gtttacccga gatgagtgga gaaagctggc cccttctcag agaaacttgt 24 0 

accgggatgt gatgctggag aactatagga acctggtctc actggggctc ccatttacca 300 

aaccaaaagt gatctccctg ttgcagcaag gagaagatcc ctgggaggtg gagaaagacg 360 

gttctggcgt ctcctctcta ggatcgaaga gcagtcataa aaccacaaag tcaacgcaaa 420 

cacaagactc ttcatttcag ggactgatac tgaaaagatc caacaggaat gtaccttggg 480 

atttgaaatt agaaaagcct tacatatatg aaggcagatt agagaaaaag caggataaaa 54 0 

agggaagttt tcagatagtt tcagccaccc acaaaaaaat ccccactata gaaagaagcc 600 

ataaaaatac tgaattgagc caaaacttca gcccaaagtc agtgcttatt aggcaacaga 660 

tacttcccag agaaaaaaca ccaccaaaat gtgaaataca aggaaacagc ctcaaacaga 720 

attcacaatt acttaatcaa ccaaaaatta cagcagataa acgctataaa tgtagtctgt 780 

gtgaaaaaac cttcattaac acttcatccc ttcgtaaaca tgagaaaaac catagtggag 840 

agaaactatt taagtgtaaa gaatgttcaa aagcctttag ccaaagttca gctcttattc 900 

aacatcaaat aacacatact ggagagaaac cctacatatg taaagaatgt gggaaagcct 960 

ttactctcag tacatccctt tataaacatc taagaaccca tactgtggag aaatcctaca 1020 

gatgtaaaga atgtggtaaa tccttcagcc gaaggtcagg cctttttata catcaaaaaa 1080 

ttcatgctga agaaaaccct tgtaagtata atccgggtag gaaggcatct agttgcagca 1140 

catccctttc tggatgtcaa agaattcatt ctagaaagaa gtcctactta tgtaatgaat 1200 

gtggcaacac ctttaagtct agctcatccc ttcgttatca tcagagaatt cacactggag 1260 

agaagccttt taaatgtagt gaatgtggga gagccttcag ccagagtgcc tctcttattc 132 0 

aacatgaaag aattcacacc ggagaaaagc cctatagatg caatgaatgt gggaaaggct 13 8 0 

ttacttctat ttcacgactt aatagacacc gaatcattca tactggagag aagttttata 1440 

attgtaatga atgtggtaaa gccttaagct cccactcaac acttattatt cacgagcgaa 1500 

ttcatactgg agaaaaacca tgtaaatgta aagtatgtgg aaaagccttc agacagagtt 156 0 

cagctctcat tcaacatcag agaatgcata ctggagaaag accctataaa tgtaacgagt 1620 

gtgggaaaac attcaggtgt aactcatcac ttagtaatca ccagagaatt catactggag 1680 

agaaaccata tcgatgtgag gaatgtggga tatcttttgg ccaaagttca gctcttattc 1740 

agcatcgaag gattcataca ggagaaaaac cctttaaatg taatacatgt ggaaaaactt 1800 

ttagacaaag ctcatcacgt attgcacatc agagaattca tactggagag aaaccctatg 186 0 

aatgtaatac atgtgggaaa cttttcaacc ataggtcatc ccttactaat cattataaaa 1920 

ttcatatcga agaggacccc tagaaagtag atttgtatgt gtgaaagcct taaaccaaag 198 0 

ctcatcgaag aatacatcct tgagagagat gtaataaatg taatggatgt gaaaaaaact 2 04 0 

gtaataattt agccctcatt aggtatttaa ttccatggat aaacctcagc tatataatag 2100 

ataatgagga aagtgtttgt gcctgtcaga cacttaaaaa aataacctga gatgaagaat 216 0 

ttacaattga agacattgac tttagccatt tgtgaaatgg gtttgctttt tcccttttcc 2220 

tacagacgta tatgctagat gtcacgtgat catcagaaac agatatccga gtgggtgggg 228 0 

aggtgtgctt tagatttctc attagaagac caccaaactg gtaatatttt tatagcattt 234 0 

taatagcata caaatgaatt gatcaaattg taccttttta gagaaaagga ccaaaataaa 24 0 0 

agaaaaat 24 0 8 



<2l0> 49 

<211> 2990 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte Clone No.: 1907472 
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<400> 49 

gtacaacttt 

tccaaatggc 

ttccttttgt 

caaaacttgc 

tcaagaggag 

tagtgaaggc 

tgttacttgc 

tgttgtgaga 

ggctgcaagg 

tggggaaatg 

tgagcccaac 

gacatctcga 

gatgaagaac 

acctcacatg 

agtgcctatc 

tcctgttcct 

ggacagcagt 

tgctcttgat 

agagacaacc 

gaagaactct 

ggatatcgaa 

attaccacct 

gggagccaag 

agaatgcagc 

aactaggcaa 

aaagttaaaa 

ttttgtcaat 

cctttgcctt 

tgatcctgga 

aagcatactt 

aaaacaacta 

taagtatttt 

taggcattgg 

gtcttccttg 

agatgatgag 

gaaaatgttt 

tttttatttg 

ttcactggac 

ttatgataaa 

gcaatcggga 

ttcaagttta 

tgtaaataat 

caaatacatt 

ttacattaca 

catccatatt 

gttcttaacc 

tttcggaatt 

gttttgtttt 

ataccatctt 

gtgaatctat 



tgcaattcca 
cagtttgtag 
tcaaaaccag 
tcagatgact 
aagactcttc 
tgcaaattat 
aactattgtt 
gacttctgca 
cgtgactgtt 
aaacatttct 
atgacaactc 
accaaaatga 
aaagcagttc 
cagaccaaat 
cctgtgcctg 
accacagttc 
gagaagattc 
acagagttgc 
aacatcaaca 
gacccagaga 
atagattttc 
gtttttggcg 
agaaaggctg 
tttcctttca 
cttgatgaag 
gaggatctac 
gagatccgac 
ggaatacagg 
taccaaacat 
ccagatgggt 
ggatcacact 
ggcctgaaaa 
aaaaaaaatc 
tgtggaacag 
ccagtatttg 
gaatgctact 
caaccagaat 
cgaaacacct 
gacaattatg 
taaaacagca 
ctccttctgt 
ctgtgagtga 
caaattcttt 
gaatatgaat 
gcacaactct 
agatggttgt 
tttaaaataa 
gtttggtttt 
ctgtaccaaa 
gctggtgaaa 



gttgtgtggc 
cgccaagtga 
aaatcctgga 
ataagaagtt 
atgaaacagt 
tatacaaaca 
ctcagctatg 
gtgaagattg 
gtaaatctca 
gtgatcaaca 
agaaaggacc 
caggttcagc 
tttgcaaacc 
cttgtcagac 
tgtatatccc 
ct.gttcctgt 
ctgcagcaat 
ttacaatgac 
gtgtaattat 
cacagtccag 
ccagagctgc 
aagaatatga 
tatcaggata 
aatatacgta 
atcttctggt 
tctctcacac 
ggccaaatgg 
agtatttgtg 
ttgagcaaga 
caatattctc 
ctccagtagc 
cagtggaaca 
ctttaacgat 
ataatgaaga 
aacaaattga 
tgtctaaaag 
gctctagttc 
tggaaaatat 
aactggatga 
ttagatagtc 
tttgagtttt 
aagttgccat 
ttttattatt 
gagaatgtgc 
actgttgcaa 
gtatgggtag 
actgtaaact 
acattttagt 
aatagacaag 
acaatttttt 



taaatttcag 
tattcagttg 
atgggagaac 
gcattgcata 
aaatttctct 
ggattttgcc 
taagaaggga 
ctgtaaaaaa 
aggaactctt 
ttgcttactg 
tgaaaactta 
accaccccct 
tttaacaatg 
agatgatact 
agttcctatg 
gccagttcct 
tgaggagcta 
ggatatgatg 
tgaaacagat 
catgcctgat 
tgaggagctt 
ggaacagccc 
ccagtctcat 
tggcgtaaat 
attagatgag 
cacagctgag 
agagaattat 
tggaagtaat 
attgaataaa 
tcgagttgaa 
tcttctgaat 
acacttaaga 
ggaaaacaaa 
taaaattact 
aaacacagcc 
tccacagaat 
tacagatagc 
gcttgtacgg 
agacacagac 
atgctgctag 
gtagcagtgt 
tattctatgt 
atttatttga 
catgtataat 
agcttccttg 
cactactaaa 
aataggctgg 
tactgaagcc 
agaatgctgt 
ttccccttat 



gctctaagta 
aaatgcaact 
aaagtgcatc 
gttacatatt 
ggcgttaaga 
agacgtttag 
gcaactaaag 
tttcaggatt 
aaagagcgag 
cgtttctact 
cattatgatc 
tctccaacac 
acaaaagcta 
tggaggacag 
cacatgtaca 
gtttttctgc 
aaaagcaagg 
agtgaagacg 
ataattggtt 
gtaccatatg 
gatatggaaa 
agacctcgat 
gatgatagtt 
gcatggaaac 
ttaaaatctt 
cttaactatg 
gcacctgaca 
cgaaaagaca 
atactgcgaa 
gaagactatc 
acactgttct 
ctttcctttg 
gcgtgtcttc 
actggaaaaa 
aatccttcca 
cttaatcaga 
cctgtctggt 
gttcttctag 
taaaaaggaa 
atctttatta 
acccacgctg 
agtggtttta 
ttaggtatgt 
ttttttcttg 
gaagggggct 
agtttagaac 
ggtttttgtt 
ttacaaggtt 
caatattggt 
taaaacctta 



tgcagtcatc 
actgcaaaaa 
agttctgcag 
gcgaatactg 
gacctttctg 
gattgagatg 
aactcgatgg 
ggtactacaa 
ttcagtggcg 
gtcaacaaaa 
agggttgtca 
ctaacaaaga 
cttactgtaa 
aatatgttcc 
gtcagaatat 
ctgctccatt 
tttcttcaga 
aggggaaaac 
cagacctttt 
aaccagattt 
atgaattttt 
ctaaaaaaaa 
ctgacaattc 
actgggtcaa 
ctaaatcagt 
ggttagctca 
gcatctatta 
acatatttat 
gctggcaacc 
tctggaggat 
actttaacac 
gcactgtgtt 
gataccaagt 
gaaaacatga 
gatgtcctgt 
ggatggatgt 
atacgtctac 
taaaagatat 
cgttgcagaa 
tggaaaacat 
ggtattacca 
ggatacttaa 
ttgtaacttt 
tagtaagaaa 
cttttactgg 
ttgcagtgtc 
ttgttttggg 
atgtagagag 
gtactgtaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2990 



<210> 50 

<211> 771 

<212> DNA 

<213 > Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte Clone No.: 1985458 



<400> 50 

cccctactaa agccttgggg ttagtacgcg 
tgccaatggt ggagaagaaa acttcggttc 
tggaccgggc tgcccggcag cgtcgcatca 
acttccagga tgacccccac gcgggactcc 
atgacgatgc ggacactgga aagaaaaaga 
gcttccgaaa aaactttcag gccctgttgg 
ctaactacct gacggcctgt gcgggacccc 
tctgtggctt cccatccccc tacacctgtg 
gctgtctggg gacccaccag gagaccaggt 
tcccagagag gaagggccgc tgtgcactgc 
ccaatgcagg gggagctctt cctggaccaa 
tgtgtcttat ctgccaggaa agaccagcct 
gggcccccca gtgctgttag aataaaaagc 



tgcgcacagt ttcttccgac agttgtgttg 6 0 

gctcccagga ccccgggcag cggcgggtgc 12 0 

accggcagct ggaggccctg gagaatgaca 18 0 

ctcagctcgg caagagactg cctcagtttg 240 

agaaaacccg aggtgatcat tttaaacttc 300 

aggagcagaa cttgagtgtg gccgagggcc 360 

catcgcggcc ccagcgcccc ttctgtgctg 420 

tcagctgcgg tgcccggtac tgcactgtgc 4 8 0 

gtctgaagtg gactgtgtga gcctgggcat 54 0 

ccggccttca gaaagacaga atttcatcac 6 00 

gggaggagcc gctcattcac ccaacaaaac 66 0 

cactcctggg aactgtctgg caggtaggct 72 0 

ctcgtgccgg aaaaaaaaaa a 771 



<210> 51 

<211> 2076 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 
<222> 1957 

<22 3> a or g or c or t, unknown, or other 



<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 2726431 



<400> 51 

cccgcctgcc cctagcggtc cctggcgtcc 
gcgaagaaat cgccccggga catggactca 
accctggagg agtgggcttt gctggattct 
ctggagacct ttcagaacct ggcctcagta 
gatcaaacca caaaccaggg gagaaatctc 
gaaagtaatg atcaatgtgg agaagccctc 
aaaattccac ctggagtaaa acagtatgaa 
cgccgcacat ccctcaagag tcccatcaca 
caggaatgtg ggcaggccta cagttgtcgt 
aatggagaga gaccctatgt gtgtaaatta 
ctcaatcggc atgtaaggat tcacactgct 
aaagccttta ttgacttctc aagtcttact 
ccatataagt gtaaggaatg tgggaaagct 
acaataacac acactggcga gaagccatat 
tattcctcaa cttttcgaag acatatgatt 
aaagaatgtg gggaggcctt cagttattct 
actggagaga aaccctacga atgcaaacaa 
tttcgaagac atgaaaggat tcacactgga 
aagaccttca tttatcccca gtcctttcga 
ccctatgaat gcaaccagtg cgggaaagca 
atgagggtgc acactggaga gaaaccctat 
tggcccatat ctttacgaaa acatatgaga 



cggttcctgt cgcgctcacc tgcgccggta 60 
gtggtctttg aggatgtggc tgtggacttc 12 0 
gctcagaggg acctctacag agatgtgatg 18 0 
ggaaaaattt gggacagtct tagcatcgaa 24 0 
agtagaaatc atgggttgga gagactctgt 3 00 
agccagattc cacatcttaa tctgtacaag 36 0 
tacaacacgt acggaaaagt cttcatgcat 420 
gttcacactg gacacaaacc atatcagtgc 480 
tcacacctaa gaatgcatgt gagaacccac 540 
tgtgggaaaa cctttcctcg tacttcctcc 6 00 
gagaaaactt acgaatgtaa gcaatgtggg 660 
agtcatctca gaagtcacac cggagagaag 720 
ttcagttatt cctcaacgtt tcgaagacac 780 
aaatgtaagg aatgtgcgga agcctttagt 84 0 
tcacacactg gagagaagcc acataaatgt 900 
tcggcttttc gaagacacat gataacacac 96 0 
tgtgggaaaa ccttcattta tctccagtcc 1020 
gagaaaccct acgaatgcaa acagtgtggg 108 0 
agacatgaaa ggactcatgg tggagagaaa 114 0 
ttcagtcacc cctcctcctt tcgaggacac 1200 
gagtgcaagc aatgtgggaa aactttcaat 12 6 0 
acacatacta gagagaaacc ctatgaatgt 1320 
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aagcagtgtg ggaaagcctt cagcttgtct gcttgctttc gagaacatgt gagaatgcac 1380 
cctgaagaca aatcctatga atgcaagcta tgtgggaaag ctttctattg ccacatatcc 1440 
ttacaaaaac atatgagaag gcataccgca gagaaactct ataaatgcaa gcagtgtggg 1500 
aaagctttca gttggcctga acttttgcaa caacatgtga gaacgcacac tgtagagaag 1560 
ccctatgaat gtaaggaatg tgggaaggtc ttcaaatggc catcatcttt accaatacat 1620 
atgagactgc acactggaga gaaaccttat caatgtaagc attgtgggaa agcattcaat 1680 
tgttcctcat ccttaaggcg acatgtgaga atacacacta cagaaaaaca gtataagtgt 174 0 
aatgtaggac atcctcctgc aaatgaattc atgtgcagtg cttcagaaaa gtcacaccag 1800 
gagagagatc tgatcaaagt tgtaaatatg gtgttgcctt tatgagttcc ttatcctgaa 1860 
agtggacact caaggagtgt gtctgtagtt catttgcata gaaactatag cgaagaggcc 1920 
cgcaccgatc gcccttccca acagttgcgc agctganatg gcgaatggga cgcgccctgt 1980 
agcggcgcat taagcgcggc gggtgtggtg gttacgcgca gcgaccgcta cacttgccag 204 0 
cgccctagcg cccgctcctt tcgctttctt cccttc 2076 



<210> 52 

<211> 1197 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte Clone No. 



2743828 



<400> 52 

cccgggtccc 

gcggcggcgg 

gggaggcccc 

atgagcccga 

catcacgaag 

aaaagaaatg 

gacgccagga 

ccaaaaactc 

aaaaagccca 

aatcgcaaac 

ctgcagtctg 

atgaagattg 

cggggtgact 

cgggaggctc 
ctgagcaaaa 
aatcacagca 
ctcatcctca 
gaccgcagca 
gtgccctccc 
attttttttt 



cttctccagg 
cggtggcagc 
gcaccgacgg 
acaaatgctc 
tcaaatgcca 
ctgggaacgc 
aaggtcccct 
caccctcatc 
tcaagggcaa 
ttacggattt 
aagaaaggaa 
acctcatcga 
ttgtggtgga 
tgtacgcaca 
cctactgcgt 
aatgtgggaa 
tcgcctcccg 
aggcttccat 
cgccccactt 
tacacactta 



caggaagatg 
gacggccccg 
ggagaacgta 
tggaatgcgt 
ggggaaacca 
agtacggagc 
ggtacctttt 
ttgtgattcc 
acaggccccc 
ctaccctgtc 
aagaatagat 
tggcaaaggc 
ataccacggg 
ggacccttcc 
ggatgcaact 
ctgccaaacc 
agacatcgcg 
tgaagcccac 
tcccttcttc 
atcttagcgg 



tccaagcccc 
ggcccggaga 
tttaccgggc 
ttcccccttc 
ttagccggaa 
gccatgaagt 
ccaaaccaaa 
accaatgcag 
cgaaaaaaag 
cgaaggagct 
gaattgattg 
aggggtgtga 
gacctcatcg 
acgggctgct 
agagagacaa 
aaactgcacg 
gctggggagg 
ccgtggctga 
aaaggacaaa 
attacttcag 



gcgcggtgga 
tggtggagcg 
agtcaaagat 
aggaagagaa 
tctacaggaa 
ccgaggaaca 
aatctgaagc 
ccatcgccaa 
ctcaaggaaa 
ccaggaagag 
aaagtgggaa 
ttgccaccaa 
agatcaccga 
acatgtacta 
atcgcctagg 
acatcgacgg 
agctcctgta 
agcattaacc 
gtgccctcaa 
atgtttttaa 



ggcggcggcg 
gaggggcccg 
ctattcctac 
ctcagttaca 
acgagaagag 
gaagatcaaa 
agcagaacct 
gcaagccctg 
aacgcaacag 
caaagccgag 
ggaagaagga 
gcagttct.cc 
cgccaagaaa 
ttttcagtat 
aagactgatc 
cgtacctcac 
tgactatggg 
ggtgggcccc 
agggaattga 
aaagtat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1197 



<210> 53 

<211> 2843 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<223> Incyte Clone No. : 2998209 

<400> 53 
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tcagatgtac 
ctcaggatct 
tagaaataaa 
ctgggccaag 
aatagaattg 
atggagggga 
ggactctgaa 

ggaggaactt 

tgattggctc 
agcatacaag 
gctgaatcgt 
tcagaggcag 
tattccagca 
tcttcatcag 
attgccagca 
gcttgataga 
ggaattggct 
ctgcctactt 
tcggaaagcc 
agattgggaa 
gtggcaagaa 
gcagacgcac 
aaaatattac 
tctgcctttg 
gaggggccac 
tggtgctaat 
tgaaaggaag 
gcaagaaaaa 
tgccatgcag 
atggcttgac 
tgaggatgat 
ctgtaatgac 
aaatctggca 
aatgcctctg 
ctaattttat 
acgcatgtgg 
attgagaaaa 
ttcctttcag 
ctaagtctca 
tttattttat 
caagaaaagc 
agagggtttt 
tttagatatt 
ggcagtggtc 
aaatgaattg 
gccatgccag 
tagaagggca 
tttaaaaatg 



ccagtgctta 
ggctcaagac 
ctgaacacta 
gctaaaagat 
gcagagactg 
agttgcagta 
gtgaaaatgg 
gaagtagact 
ttcctcaatg 
tcatcaggag 
ccttctgcct 
gatatgctag 
atggtgtgtc 
agaaaggggg 
gatattgaag 
gacgttcaaa 
acacgtttgg 
gattcagttc 
ctgcatgaca 
tcatggtatt 
gactgggcat 
atttttgtac 
aagagtttcc 
ttgtgggaac 
ttctctgctt 
ctcaataccg 
ctactccatg 
ctgctcaggg 
aagagttctc 
cgctaccgac 
gaagatgaat 
cctaaagtta 
ggatctgctc 
ctgcgtgagg 
taagacagaa 
ttgtgtaaga 
ttttttccag 
atattatttc 
gatctggaaa 
taataatttc 
taagaatgtt 
ggttttggtt 
tatttgttgg 
tgtgagtagt 
tgtgaaatgt 
agggtcttcg 
tgcctttgct 
caaaaaaaaa 



tcccaacgta 
cagttgcttt 
ggacacagca 
gtgttgtttg 
aagaggcttc 
gtggtaatag 
attttcagag 
ttaaaaaact 
cttgtgtggg 
gagacattgc 
ttgatgttgg 
caatattgct 
ctgaactgac 
attttgcttg 
atttgccccc 
aagaattaga 
acagtcgact 
tacaagctac 
gcctgcatga 
ctcagagctt 
ttatactctc 
tggcacatat 
ggggagaaac 
agagtttttg 
tggttgccat 
atgatgatgt 
tgcacttcct 
agtggctgga 
ggcggcgaaa 
agatccggcc 
gaaaaaaaaa 
gtgtggtgct 
ggggaagtgt 
agacagagaa 
ctttttttcc 
cattgtttaa 
catgggcact 
agtagaacct 
cgttttgtac 
gttcctcttg 
ttagagtgaa 
attgtgtctt 
gaagaatacc 
ttttttcctg 
gctcacttgg 
gattcttcct 
taggcagatt 
aaa 



ggaccaggag 
ttctgttgat 
ctggacttgc 
tgatcatccc 
ttcaataata 
ccaaaggaga 
gattgaattg 
aaagcaaatt 
ggttgtagaa 
acgtcagctc 
ctatactctt 
tacagaggtg 
agaacaaatc 
ctattttctg 
aacagtccaa 
agaagaatct 
gtatgcactt 
cfcggggcatc 
ctgttcacat 
tggtttacat 
tcttgctagt 
tcttagacga 
tttaggatat 
ttggaaaagt 
ggaaaatgat 
caccatcaca 
ttctgctcag 
ctgctgtgtg 
tcaccccctg 
gtgtacatcc 
atcaaacagc 
ccaagcagag 
tttcctggac 
ctttagttgg 
ttccaaattg 
taggaaaagt 
tagaaaaagc 
ggcattctcc 
cgattatcca 
tgcccaatca 
ctaaatacag 
ttaagttttc 
ttaaaatgag 
gatgaaaagg 
actccatcaa 
tctatcacct 
gggaatacca 



tcctacagaa 
ccttgtgagg 
tctgtttgca 
agacctaata 
aatgagcaag 
tcacctcctg 
gctggtgctg 
aaaaacagga 
ggtgatttag 
accgcagatg 
gtacacttgg 
tctcaacaag 
cggagagaga 
actgaccttg 
gaaaaattat 
ccaattatta 
tggaaccgga 
tatgacaagg 
tggttttaca 
ttttccttga 
cagcctggag 
ccaattatag 
actcggtttc 
ccgattgctc 
ggctatggca 
tttttgcctc 
gagctaggta 
acggaggggg 
gtcactcaga 
ctgtctgatg 
agaagaccaa 
tcgacatcat 
cacacacacc 
actacagttt 
taaatctgtc 
tgtaccagca 
acatggcaaa 
tttcacctta 
cagcaaaaca 
aatcttttag 
acattgctta 
tgatatgccc 
ggttcttatt 
gagcaagccc 
caatgtgctg 
ctgctctaag 
attcactaca 



tctcctcagc 
aatacaatga 
catatgaaaa 
acattgaagc 
acagagctcg 
ctacgaagcg 
tgggaagcaa 
tgaaaaagac 
ctgccataga 
aagtacgctt 
ctatacgttt 
cagcaaagtg 
tagctgcctc 
taacatttac 
ttgatgaggt 
actggtcctt 
ctgcaggaga 
actcagtgct 
cacgctggaa 
gagaagaaca 
caagcttgga 
tttatggagt 
aaggtgttta 
tgggttatac 
accgaggtgc 
tggttgacag 
atgaggaaca 
gagttctggt 
tggtagaaaa 
gagaggaaga 
ggcatcagat 
ggaatgaacc 
ttatggagat 
gtaaaaaaaa 
tataaatgta 
tcttcatatt 
tggctctttg 
aaagatccat 
aaaataagct 
gaacaaactg 
cttgttttga 
cctttcaata 
ccagattctg 
acttgtcact 
ctcccagatt 
caaatcttgt 
gaataaagat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
300 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2843 



<210> 54 

<211> 1272 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc feature 
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<223> Incyte Clone No, : 3340296 
<400> 54 

tctacggcca cgactctggg agtggggaaa 
ctcggggttc cttctcacaa ctctgcgaag 
caactccagc tcccagcagg aggtggctgc 
gtctctctgt ctctctctca ccctcttcct 
ctctctccct cgtccactcc cccaaacatg 
cttcaagagg tggagatgtt ggaatgtgac 
gtgacttcca acgagagcac cgaggagagc 
ggccgcggcg gcctgggcaa gaggaggaag 
gtcagccagg aggggaagca ggtccagcgc 
atgcgagtgc tgagcaaggc cttctccaga 
gacaccaagc tctccaagct ggacacgctc 
aggcagatcc tggctaacga caaatacgag 
tggcccttta tggtggccgg gaaacccgag 
cgcttatgtg gaaccaccgc gtcctgacct 
ccggggggag cgggccccgg gaaggcgacc 
cccctcgcaa tgctcctctc tctgtcccac 
attcgtttcc aaaccagagg agatcaattg 
tattaacttc taccgtgaat gactctgcaa 
ataaatatat aaatagataa gagcctatca 
gtagatcata ggatagctga ctttgacagt 
tatatttttt tcaaacaagt tttgctactt 
aaaaaaaaaa aa 



cagagagccg gttcctctgc tgcagaagtc 6 0 
gggaaagggt tgtgagaccc aaccagaccc 12 0 
gccacactcg ggaggcctct tggtttcagg 18 0 
cgctttctct gtctctctgt ctctctctct 240 
tccaccggct ccctcagcga tgtggaggac 3 00 
gggttgaaaa tggattcgaa caaggaattt 36 0 
tccaactgcg agaatgggtc tccccagaag 420 
gcgcccacca agaagagccc cctgagcggg 4 80 
aacgccgcca acgcgcgaga gcgggcccgc 54 0 
ctcaagacca ccctgccctg ggtgcccccc 600 
aggctggcgt ccagctacat cgcccacttg 6 60 
aacgggtaca ttcacccggt caacctgacg 720 
agtgacctga aagaagtggt gaccgcgagc 78 0 
tggaggtgcg agtctgggaa aggcgcgctc 84 0 
cctgccctca gtgctctctg tctctgcttc 900 
cccgcgagaa cactttacaa cgacgaggag 96 0 
tacttacaaa gattcccatc tatttaactt 1020 
gccttgctgg tccaagtgca atatgtaatt 1080 
atgtatcttt tgtacaatat gttgtaaaat 1140 
cacatttata aagtaattca cttaaagata 1200 
ttgaaaataa atctttcttt atattgctaa 1260 

1272 



<210> 55 

<211> 1117 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 3536740 

<400> 55 

tgcactgcca cgccgagctg aggctgagct 
gctacaagac tttcatgatc gacgagatcc 
aactttccct ctactccgtg tgcccgtcgc 
gtacgggctc cccttccctg cgggcatatc 
ctgtcatctc ccacctggtc cctgccaccc 
aggtcaccga ggcggtctct gctgaggccc 
cagagacgga acagcccacg ccccgacaga 
ccgagctgca gctcatgggc ctggagaaga 
cagacaggtt ggacttggct cagtctctgg 
atcagaatcg caggatgaaa tggaagaaaa 
caaaacccaa aggtcgcccc aagaagaact 
aagagaagat gaacagccag gcccagggtc 
aggagctctg tgaagcacag gaaccgaaag 
caccagaccc gccccaggag ttgccaatac 
gtaaaaccet tttgagggaa gagggagact 
gagtagggag agaaaacctt ccagcagccc 
ctccctccct cccacagtta ccattggcct 
ttgtcttggg cctgtcacct cctgaaaggc 
gagagagcgc ctaggagctg cctgccccag 



cgcccggcca gctcaaagca gccaggcggc 6 0 
tctccaagga gacctgcgat tactttgaga 120 
tggtcgtgcg acccaagccc ctgcattcct 180 
cgctcctctc ggtgatcacc cgccagccca 240 
cgggaatcgc ccaggcactg tcctgccacc 3 00 
cagggggcga ggccctagcc agcagcgagt 360 
agaagccccg ccggagtcgc accatcttca 420 
aattccagaa gcagaagtat ttgtcaaccc 480 
gactcactca gctgcaggtg aagacctggt 540 
tggttcttaa aggtggacag gaagcaccca 6 00 
ccatccccac atcagaagag attgaagctg 660 
aggagcagct ggagccctct caggggcagg 72 0 
cacgtgatgt ccccttagag atggcagagc 780 
cctcttcgga acccccacca ttaagctaaa 840 
ggggagaagg gaaaagagag aaggcaggga 90 0 
agtaaactgc gggcgaagag atctacccgt 960 
tgtcatcgca agcatttgac aaagacttgc 102 0 
tgctttagct gtggatgccc ttgattaagg 1080 
ctggggt 1117 
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<210> 56 

<211> 3033 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc__f eature 

<223> Incyte Clone No. : 082155 



<400> 56 

gcagatgcta 

tgttcagctt 

tcctgtgggg 

acaccggctc 

ctgaagcact 

gggagcagtg 

atctgcctgg 
gcctgcatca 
cctgtgtgtc 
atagttgaga 
tgtgagcacc 
tgggtttgtg 
gtcaaggaat 
gaggctgaga 
cagatcgaga 
gaggagcaga 
gcagcagcta 
ctccagcgga 
aaaaggagtg 
gtattccgag 
cagtactact 
tctgtggatc 
tgtgattttt 
tgggaagtag 
agtctgaata 
gtttactcca 
gaatataatg 
gctgtgcctc 
ttcaatgtca 
cctgcttatc 
agctcctgag 
cagattctgc 
ttacttagaa 
ctgatttaaa 
agcagagttg 
ggcttgtaga 
gcccaagttt 
gccttaatga 
aatatttgga 
cactcataac 
ttggtttcac 
atccttccta 
ctgcccctta 
aagtacataa 
acaccccata 
tggattttta 
tgatcttaaa 



gaacataatg 
cctttgtttt 
aaactcattt 
ctgccctgcc 
gcaggagttt 
caatggattt 
agctcctgac 
ctgcaaagat 
agaccagatt 
gagtcaaaga 
atggaaaaaa 
aactgtctca 
gtcaggaaaa 
agctggaaga 
gacagaagat 
gagagctgca 
cagaccagct 
ggttgacggg 
aaagctggac 
taccagatct 

gggtggacgt 

agagacaagt 
ctgcttttgg 
atgtgtctgg 
aaaggaagag 
gatatagacc 
cttttgagga 
cctgtcgtat 
caaaccacgg 
cgtatttcaa 
tgttctcatt 
acctgagttc 
tgtctttgta 
ctgtaattgt 
gttctttctg 
aggcatcttg 
tcccttttta 
aatcatttta 
ggtattggga 
aatcttaccc 
tagtagtaaa 
tagaaagaac 
aaagattgaa 
gaatctatca 
ttcatcacaa 
atgctcagag 
taatttactt 



tagcattact 
ctttcacttc 
gtggagacca 
ttcactcttc 
gtgaccaaga 
ctcagtaaag 
agaacctctg 
caaggagtca 
ccagcctggg 
ggtcaagatg 
actccagatc 
ggaacaccaa 
gctgcaggta 
tgacatcaga 
tctgaaaggg 
aaagctggag 
ggtccagcag 
atcgtcagta 
attgaagaag 
gagtgggatg 
gatgctgaat 
gaaaactgta 
tgtcttcggc 
aaagattgcc 
ctctgggttt 
tcaatatggc 
ctcctcctct 
tggggttttc 
agcactcatc 
tccttggaac 
cctttaccca 
atctctactg 
ttcatttgct 
attgccgtac 
aggtctgcaa 
tccctatgac 
ttaagacacc 
acattatatt 
ataaaaactc 
tttcttgcaa 
cattattatt 
agtattgctg 
gaaagagaaa 
ctaagtaatg 
aattaaagca 
tttctgaggt 
aatgtatttt 



ttccccaggg 
tgagaattta 
gccctctggc 
tcccctgatt 
acttcaagag 
gtagacatag 
agcctagatt 
gtgatcatct 
aacctccgac 
agcccacagg 
ttctgtaagg 
ggtcaccaaa 
gccctgcaga 
caagagagaa 
ttcaatgaaa 
gaaggtgagg 
aggcaggatg 
gagatgctgc 
ccaaaatctg 
ctgcaagttc 
ccaggcagtg 
cgcacctgca 
tgccaatatt 
tggatcctgg 
gcttttgatc 
tactgggtta 
tctgatccca 
ctagactatg 
tacaagttct 
tgcctagtcc 
cttctgcata 
agaccatctc 
agggcttcca 
tgtgggctgg 
ggaagggctc 
tcttcacatt 
agtcatactg 
ctctacaaag 
caacatataa 
gagatgcttg 
ttttttatat 
taattecttt 
cttgtcaact 
tatccttcag 
agaagtccat 
caaattttat 
ggtgtatttt 



tttattgtta 
actttcgttt 
ttggtgagtg 
caagactcct 
tcaagacaga 
agaaggaggt 
gtggccacag 
caagagggga 
ctaatcggca 
aggggcagaa 
aggatggaaa 
cattccgcat 
ggctgataaa 
ccgcctggaa 
tgagagtcat 
tgaatgtgct 
ccagcacgct 
aggatgtgat 
tttccaagaa 
ttaaagagct 
ccacttcgaa 
catttaagaa 
tctcttcggg 
gcgtacacag 
caagtgtaaa 
taggattaca 
aggttttgac 
aggcaggcat 
ctggatgtcg 
ccatgactgt 
gtagcccttg 
ttcctttctt 
tagcaaagca 
aaatcccaaa 
tgttccatgc 
gtctttatgt 
gctcagggcc 
accttatttc 
atttgaggaa 
tacattattt 
ttgcaaagga 
tcttttcttc. 
catatccacg 
aatgtgttgg 
agtaatttat 
cttttcactt 
cctcaaatta 



tgtaagttct 
ctcactcagc 
aatctggttt 
ctgctttgga 
aggaagccaa 
gacctgcccc 
cttctgccaa 
aagcagctgt 
tctggccaac 
gagagatgtc 
agtcatttgc 
aaacgaggtg 

ggaggatcaa 

gaattatatc 
cttggacaat 
ggacaacctg 
catctcagat 
tgacgtcatg 
actaaagagt 
gacagatgtc 
tgttgctatt 
ttcaaatcca 
gaaatattac 
taaaataagt 
ttattcaaaa 
gaatacatgt 
tctctttatg 
tgtctcattt 
cttttctcga 
gtgcccaccg 
tgctgagact 
tccccttctt 
tcatagattg 
tctagattcc 
ctctctcctt 
acatctctgt 
caccgctaat 
caaataagat 
ggcacgattt 
tcctaatacc 
aacatatcta 
ctcatttcct 
ttatctagca 
tttaccagtg 
ttgctaatag 
acaagctcta 
atattggtgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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tcaagactat atctaattcc tctgatcact ttgagaaaca aacttttatt aaatgtaagg 2880 

cacttttcta tgaattttaa atataaaaat aaatattgtt ctgattatta ctgaaaagat 2940 

gtcagccatt tcaatgtctt gggaaacaat tttttgtttt tgttctgttt tctttttgct 3000 

tcaataaaac aatagctggc tctaaaaaaa aaa 3033 



<210> 57 
<211> 6138 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte Clone No.: 095477 

<400> 57 

ggctgaggac tgactggggt tctgagactc 
tgaacaaagt ctgctcaaat ctcctgctgt 
cacatataaa aatctattgt aaaaatacgg 
aactttgggc ctgaatggct ccgagctctg 
ctttctccag cattgccgaa gtataaatta 
ttagcacttt tccttaaaga caacaagata 
cctatcctcc aggaggaacc ccttccacca 
cagagaaact tttccatgtc tgtaaatagt 
ggaggaggaa cagtggtggg ggctcctaga 
ggcagaggtg aatgtggttt ctaccaaaga 
cgaggaggtg gcagagaaat gcatagatcg 
tttgaaaaac caggacgaaa agatgtaggg 
tcagtaggga gaaagcatga atttatacgc 
gaggaacaaa atggagaaga tgaagatgga 
ggagagaggt ggcgacctca cagtcctgat 
atggaacgac gtcggaggtt tgagtttgat 
cgaagggttc gctctggcag tgggagcata 
tgcttagagg atgctgaaga agaaatgggt 
ctaaaaaaag tacagaaaga gcctattcca 
gacgaagggg aggagtgctc tgactctgag 
gataagacaa ataagaaaga aggagagaaa 
gaaactcccc agacctcatc atcatctgct 
caggaagcat cacagtttga gaggaaagat 
gaagaggaga ctcggatgga aaatagtcta 
atggttgctg atgtccagca gcccctgtcg 
ctcatacttc cacctcctgt tcccaatcct 
gttgtaggtg ctcctggtat gggcagtgtt 
aaacatttgg agcagcaagc tgagaaaatg 
gatgaaagat tggcatcaaa actgcaagag 
atgcatgaag caatgcagaa gtggtattac 
ttcaataatc aggagatggc agaatggttt 
gtgaagagag cgtgtgatga aagcttccaa 
agggttccct tttctccagg tccagctccc 
cgactgacca ggcagcaaga actcacagcc 
cagtttttaa tacaacaaca atatgcacag 
tcttcccagc agcagcagca gttggcactt 
agaatatctg atcagaacat cattccctca 
ggctctatct gggagcttca gccaacagct 
gtatgggatc ttcctctgga caccacgaca 
ctagagaagg ccaaagctgc aaagctagag 
aaacgggaag aggaagagcg aaagaggcag 



cctgtcccgg accgcagcgt taaaaggatc 6 0 
gaaccagcag aatttttgaa caggtttctt 120 
aaaagaatgg cagcggaaac gcagacactg 180 
tccagtggtg ggagtattac atcccctcct 240 
gcagattatc gttacggcag agaagaaatg 3 00 
ccttcagacc ttctggataa agaatttctg 36 0 
ttggctctgg taccctttac agaagaagaa 42 0 
gctgctgtcc tgcgattgac aggacgagga 480 
ggtcgaagtt cttcaagagg gcgaggcaga 54 0 
agttttgatg aagtagaggg tgtttttggt 6 00 
cagagctggg aggaaagggg tgacagacgt 6 60 
agaccaaatt ttgaggaagg tggaccaaca 72 0 
tcagaaagtg aaaattggcg catctttaga 78 0 
ggttggcgac tagctggatc aaggagggat 84 0 
ggccctcgtt ctgcaggctg gcgggaacac 900 
tttcgagata gagatgatga acggggttac 96 0 
gatgatgaca gggatagctt gcccgaatgg 102 0 
acatttgact catctggagc attcctttct 1080 
gaagagcagg agatggactt ccggcctgtg 114 0 
ggtagccata atgaagaggc caaagaaccc 12 0 0 
acagatagag taggagttga agctagtgag 12 6 0 
agaccaggta ctccttcaga ccatcagtct 1320 
gaaccaaaaa ctgagcaaac ggaaaaagct 13 8 0 
ccagccaaag tgcccagcag aggggatgaa 144 0 
cagattcctt cagatacagc ctctcctctt 1500 
agtcctactc tccggccagt tgaaacacca 156 0 
tccacagaac ctgatgatga agaaggtctc 16 2 0 
gtggcttatc tccaagacag tgcactagat 16 8 0 
cacagagcta aaggagtgtc gattccattg 174 0 
aaagatcctc agggagaaat tcaaggtccc 18 0 0 
caggcgggct attttactat gtctttattg 186 0 
cctcttggcg atatcatgaa aatgtgggga 1920 
cctcctcata tgggagagct ggaccaggaa 1980 
ttataccaga tgcagcacct gcagtaccag 2040 
gttttggccc aacagcagaa agcagcactg 2100 
cttcttcaac agtttcagac cttgaagatg 216 0 
gtaactaggt ctgtgtccgt gccagatact 222 0 
tcacagccta cagtttggga aggtggtagt 2280 
ccaggccctg ccctggaaca gcttcagcag 2 34 0 
caagagagaa gagaggcaga aatgagggca 24 00 
gaagaactcc gaagacaaca ggaggaaatt 246 0 
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cttcggcgac 
aaacaggaag 
gaagaggaag 
gaagaggaag 
aaatgggccc 
gaagaggagg 
gtccagcggc 
ttgcagcagc 

tggggccagc 

atccaaaaac 
agggagttga 
tgggggaatg 
gaggccaggc 
gctcgtaaca 
ataaatactg 
gctgacacta 
aggaattcaa 
aaccggcaga 
aataaagccc 
gcaaataact 
gaggtccatg 
aagcagttcc 
ctgccacagc 
gactctgtgt 
aacaaccaac 
atggtccgag 
aacatgggtg 
cctctcctgt 
actctttatc 
ctttgtgtcc 
tggactggac 
taggcagcat 
gatcggccat 
tcttcttttt 
tggttaacag 
aggcagtcag 
tcagagaggc 
cacctgttct 
tgacttctga 
ctatggccfcg 
ataaatattt 
ttgtcfccgtt 
ttttggatag 
agcattgcgt 
gcaagttgtt 
catccatgga 
gagcaggaag 
aaatgtactc 
gaacatcacc 
cgagaggtgc 
cctgaagaag 
ctatttttat 
acatgtgtaa 
gcatgagggg 
tcaaaagttg 
cggaactttt 
ctatctgagt 



agcaggaaga 
aggctctgcg 
aaagacagca 
aaaggcggaa 
gggaagaaga 
cagccagact 
agaaggagtt 
agcagcagca 
agtccaatac 
cagaggaaga 
tgaaagctct 
ccagcaaacc 
aaatgcaaaa 
atacgcattc 
gtcctcctaa 
aaaactccaa 
caaataaaaa 
ataagaaagt 
aagatggatt 
tggatgttcc 
attatatcag 
ttgagcgccg 
agcagcagca 

gggggatgaa 

aatccaattt 
cagatcccag 
aaatcgagac 
ctgccgacta 
actctgcaac 
aagattcttt 
cctgtgtctt 
gtgttcactg 
tagcagcttg 
ccccccatca 
acattggcca 
agctctcccg 
cctgagtggg 
ttctgagtgt 
gcaagctgat 
ggagagagac 
ttacttcaga 
taagaggcag 
ggataaactt 
gtgtgtcttg 
tgccttttct 
atctactata 
cctggcctgg 
aagccctatt 
tggaattgcc 
tgtggtctgg 
gctggagttg 
tattttggat 
taactataca 
agaggaaacc 
ggggtgggta 
gccagtgtgg 
gcacctcttg 



agaaaggaaa 
tcgccagcgg 
gcaagaagaa 
gcaggaagaa 
agaagcccag 
ccggcatgag 
aatgcgccag 
acaacagctg 
aacagcatgt 
acgagaacgg 
tcagcagcag 
ttcaggtacc 
gcagcagcag 
caacctgcac 
ccagtgggca 
catgggattc 
taaaaacaac 
agaagaagaa 
tacgcagtgg 
cacatttgtt 
ggcctattta 
tgccaaacag 
gcagccgcca 
ccacagtaca 
tgaggctgtg 
tttattagga 
gttggatgac 
tggagtctcc 
aaatcacaga 
aatccatttt 

gggggtggca 

tgctgtgatt 
ctttctcttg 
gggcaaatgg 
ataacaaaac 
gtctgaaagt 
gttgagatga 
cctttctctg 
ctcccctggc 
tattcttaat 
ggactaggac 
ctagaatctt 
ttgcttctga 
ctgtagcccg 
gaaaagagaa 
caggaggatg 
cttctggtca 
tataaacaaa 
actcacactg 
gcagcccctg 
ctctgagagc 
caccattctc 
gagactgcta 
attaaaaatt 
agagggatag 
aggaaaataa 
tactcacctt 



aggcgagagg 
gagcaagaaa 
gctcttagaa 
ttgttacgca 
cgtcgattag 
gaagaagaac 
aggcagcagc 
gcgcagatga 
cagtcccagg 
cagcttcgag 
cagcaacagc 
acgaaatctc 
cagcagcagc 
accagcattg 
tctgacctag 

tgggatgcttg 

gccagtctca 
gaaaagttgc 
tgtgaacaga 
tctttcctga 
ggagatactt 
aaagccaacc 
cagcagccgc 
ctccattcag 
cagagtggca 
ttttcagtca 
tactgagcac 
acctttggac 
accgatcatc 
tgttggtgaa 
gtfcgggatta 
tcatctactg 
tcactttttt 
tctaactggt 
accccatgga 
tgcattgcca 
ggttggtttg 
aaaggattta 
atgctccaac 
ttttctttct 
cattttgttt 
taccatatgt 
taaaagcctg 
gaaaaggttt 
catacagaac 
cagtgggctg 
tggcctccta 
tacttttcct 
ggttggagtc 
gaaaagcacc 
agtttgggtt 
eetatccctt 
caaaattgta 
ggggctccta 
ttaaaatgtt 
aaaagaactt 
tatggaggct 



aagaagaact 
ttgcattaag 
gactggaaga 
aacaggaaga 
aggagaaccg 
ggaagagaaa 
agcaagaggc 
agcttccttc 
ccacgctgtc 
aagagcaaag 
aacagcagaa 
ttctggagat 
aacaccagca 
ggaattctgt 
tcagtagtat 
cagtgaaaga 
gtaaatctgt 
tgaagctctt 
tgcttcatgc 
aagaagtaga 
ctgaggccaa 
agcagcgtca 
cacagcagcc 
tatttcagac 
agaagaagaa 
atgcatcatc 
ctgccagtgg 
acaacactta 
tcaggctttt 
catctcagac 
ctccccaaca 
tctcccagaa 
tcttctattt 
gcaatcatga 
ctgtgactcg 
ctgctaactt 
gtttgatgtt 
tgtttttctt 
ctgattggac 
tacaaaaact 
tgggcccttc 
atgaatttgt 
gaatttcatc 
tgtgtaaaga 
ctgtccatct 
gaggggatgg 
aaaccttaaa 
gcctccacca 
attgggcagc 
tttgctgcct 
tggagtatta 
cttgcctccc 
tatagttttt 
ctctcctttg 
tacaaaactt 
aaataaaatc 
gagttctgca 



tgcccgaagg 
gcgacagcga 
gaggagaaga 
ggaggctgca 
gctgcggatg 
ggagctggag 
tctccggagg 
ttcttcaacg 
gttggctgaa 
gcgccagcag 
actctcaggt 
ccagcaggaa 
accaaacaga 
trtggggctct 
ttggagtaat 

ggtgggacct 

aggtgtgtct 
tcagggagta 
ccttaatacg 
atctccttat 

ggagtttgcc 

gcagcagcag 
acaacagcag 
caatcaaagc 
aaagcagaag 
ggagcgactc 
actggccatc 
ctcaccattt 
tcttctggcc 
tatagataag 
aggctgattt 
agtgtgttgg 
tgttttttct 
agagagttaa 
agtatccaac 
tgggattgca 
acacactcct 
cgttagatag 
aaaggaagct 
gatttttccc 
tgctgaaaat 
ataatttcat 
tggtcctcag 
ttctgggatg 
ttaagacctt 
gcgaaaatgg 
cttcaagtag 
aacccctaca 
tgtgcctgtg 
gtcattgttg 
tatttggctt 
tcccttctaa 
ggatcaaata 
ctttgtaaat 
taggctccct 
tgattgtatt 
ctaaactgtt 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
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cctcttggta ccatggaaaa gctccaagca cccaagacat ggaggcagcc atggcttctt 594 0 

tctctgccag accacgtagc actggctggt tctgtatttg agaatgtaga ggtcaaggca 600 0 

gatgtcggaa tgctgatacc tgattctagt tatggatatg gaggcaagga gaagttgtta 6060 

ggttaaccag cgcagtcctc cgtgcgtccc gaccgccgct gccctcactc ccggccaaga 612 0 

tggcatcctg tctggcac 613 8 



<210> 58 

<211> 3190 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte Clone No. : 1399169 

<400> 58 

cttcggggcc aagatggcga cgggaacggg 
agagccatgg tccatccgag agaagctatg 
tcaaaattgg gtatcagtta gcagagcaat 
agactggttc tctcaaaaac attgtgcttc 
gacaccaaaa cggaaacgag gtgaaaaggg 
tgttcggaaa ttgactgctg agcgagttga 
ggagagatat agacggctaa agagagatgc 
cagactggat gagctttgca atgacattgc 
tgaagtaaag aggaaggcta cagatgctgc 
cccccggagg ttacccactg tgatggttcg 
tgattatcca cttggggact tgactccaac 
ccccgggact ttgccgagta ccccagtcac 
tccaggctct gcacccttag aagcccccat 
aaagatgctt: ggacagaaag caactccacc 
gaagggcagc ctcctgccta ctagccccag 
ttctggccac ctgaacagta caggtgtcct 
acatggtggg gagatacagc aaacacccaa 
tgctcccact ctttcccggc ttttagaagc 
ttccttcact actgttgcca gtgagcctcc 
tgtgtcccaa gctaccattg tcatgatgcc 
tgtctccact actgaaagtg tagctccagt 
ggctgtgggg gatccacata ctgtgactgt 
catcaattct atcaaagaag agtgttttcg 
aaaggctccc agcatagatg ggaaggaaga 
tgtgtcttac acaggtgaag agctggattt 
tgaggacaag gtagatgatc atcctgaagt 
gtcattttgt gaagaaaatg atgatcctca 
ccagcaggag cgggacaagc cagtacctct 
gagactggac tttgaggaaa cggaaaacaa 
gcccagtgca gagatcaagg tggaacctgc 
aatagtagct ggagttgttc cagccacaag 
cttagatgag gaactgggaa gtactgcagc 
tgggaaaggc gatgagactc cacttacaaa 
gttgtctcca tcacatggct caaatcccat 
caagtttgaa atgtcagact cattgaaaga 
aaaggatgcc ccaggtgagg atgaggagga 
ggagcctaag gaagaggatc aaggagaagg 
tgtgagcgag agtgatgatg gcttcagcat 
ggcagactcc atccccagca gccctgcttc 
ggaagctatt caggcacaga aaatttggaa 



caaacacaag ctgctaagca ctggccccac 6 0 
tttagcatct tctgtcatga gaagtggcga 12 0 
caagcccttt gcagaacctg gccgccctcc 180 
ccagtactcg gagcttttag agaccactga 240 
agaagtggtg gaaactgttg aagatgttat 3 00 
agaactaaag aaagtgataa aggaaaccca 360 
agaactaatt caagctggac acatggacag 42 0 
aacgaaaaag aaattggaag aagaggaggc 480 
ataccaggct cgtcaagcag taaaaacacc 54 0 
ctctcctata gattctgcct ccccaggagg 600 
cactatggaa gaggctacct ctggggtaac 66 0 
ctcgtttcct gggattcct-g acacccttcc 720 
gaccccagta acagatgatt caccccagaa 780 
cccctcccct ctgctgtcag agctcttgaa 840 
actggtcaat gagagtgaaa tggctgtggc 900 
cctggaggta ggcggggtcc ttcccatgat 96 0 
tactgttgca gcctcccctg ctgcatcagg 1020 
tggtcctaca cagttcacca cacctcttgc 1080 
agttaaactt gtgccacccc ctgtagagtc 114 0 
tgcgctgcca gcaccatcct ctgctccggc 1200 
gagtcaaccc gacaactgtg ttcccatgga 1260 
ttccatggac agcagtgaaa tatccatgat 13 2 0 
at caggggta gcagaggct c ctgttggatc 13 8 0 
attagatctg gctgagaaga tggatattgc 144 0 
tgagactgtt ggagacatca ttgccatcat 1500 
gctggatgtg gcagcagtgg aagcagcact 156 0 
gtccctgcct ggcccctggg agcatcctat 162 0 
ccctgcacca gaaatgacgg tcaagcaaga 1680 
gggaatacat gaactggtgg acatcaggga 174 0 
agaaccagag ccagtcattt caggagccga 18 00 
tatggagcca ccagaactca ggagtcagga 186 0 
tggagagatt gttgaagcag atgttgccat 192 0 
tgtgaagaca gaggcat ccc ctgaaagcat 198 0 
tgaagatcct ttagaggcag agactcagca 2 04 0 
agaatcaggg actatttttg gaagccagat 2100 
agatggtgtc agtgaagcgg ccagcctaga 216 0 
ctacttgtca gaaatggata atgaacctcc 2220 
acacaatgct acactgcagt cacacacact 22 80 
ttcacagttc tctgtctgta gtgaggatca 2340 
gaaagccatc atgcttgtat ggagagctgc 24 00 
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agctaatcat aggtatgcca atgtcttcct 
ctaccacagc attgtgcaga ggcctatgga 
tggactgatc cgaagcacag ctgaatttca 
tgtaatgtac aatagctcag accatgatgt 
tgtcttggaa cagatccagc aattcttggc 
tgggatcagt gctaaaagtc ttcgagggag 
gaaggacagt gtcccaatgg gctctcctgc 
caggggacgc cgctgtgcca ttgaagcaga 
taccctcttt gagccgaacc taaaataaaa 
ccagttccag aggtggaagt tacagaagag 
aaaatctctc ttagagagtt ggagtagcac 
ctatgttaat gtcctaatgt gtagctagca 
ttgtacagac tttttaaaaa aaacaaccac 
aaaaaaaaaa 



gcagcctgtt acagatgaca tagcacctgg 246 0 
tttgtcaact attaagaaaa acatagaaaa 2 52 0 
gcgtgacatt atgctgatgt ttcagaatgc 2 58 0 
ctatcacatg gcagtggaga tgcagcgaga 264 0 
cacgcagttg attatgcaaa catccgagtc 2 700 
agattctacc cgcaaacagg atgcttcaga 2760 
cttccttctc tctctctttg atggaggaac 2820 
tatgaagatg aaaaagtgaa gcctcagagt 2880 
gtaaacaaga tagagcttgg gcttgcgggc 2 94 0 
gaggtacctg ggccacacga catgagctgg 3000 
aattgcctgt tttagggcag aaaccatggg 3 06 0 
gatcgtagct agtttgtatt gtcttgtcaa 3120 
cagtgaaatg tgtgtgtata caataaactg 318 0 

3190 



<210> 59 
<211> 1391 
<212> DNA 

<213> Homo sapiens 



<220> 

<221:> unsure 
<222> 1203, 1204 

<223> a or g or c or t, unknown, or other 



<220> 

<221> misc_feature 

<223> Incyte Clone No.: 1442069 



<400> 59 

tttttttttt ccagaggaga gtacaggtcg 
gctcttggag cagtcaggca gtgactgcct 
tatagagagc tacaacaatg cccaaaagaa 
agccaaagag aagatctgcc aggttgtctg 
agcctaaaag aacatcaagt tcaaggaaaa 
acatagatac aagtgcccaa gcagttgctg 
actacaatga aaatgctaaa aatggagaag 
aagaaattgt ggaagtaaaa gaagaaaata 
agaaagaagc agtggcagca gaagtaaaaa 
aagatcaaaa cgaagagaaa ggggaagctg 
aagatggaaa agaggataaa aatggaaatg 
atggaaaaaa aggtgaagac ggaaaaggaa 
atgaaaaaga ggaagaagac agaaaagaaa 
aagagaaggg agataaaaaa gaggggaaag 
gagaagatgg aaaagaagat gaaggtggaa 
atttaaaaga agaggaagaa ggaaaagagg 
aagaggagcc acagagtatt gtttaaaact 
gtaccttcat gttgtaaagt taatagagat 
agcctttctt tagcattgat ttaatttcag 
gtttctaa'ca tgaaacattt atctataaat 
aannatatgc tttcaacttt gtgagtggat 
ggttttaatt tactctttta tacatgtgat 
tcatccacat tcctgtctgc aggtgcttta 
aactatcttc c 



tgctgcagtt agttcattga aaactcattt 60 
tcggcttttt ttctgctgac taagatctcc 120 
aggctgcagg tcaaggtgat atgaggcagg 18 0 
ctatgcttgt gccagttaca ccagaggtga 240 
tgaagacaaa aagtgatatg atggaagaaa 3 00 
aaaccaagca agaagcagtt gttgaagaag 36 0 
ccaaaattac agaggcacca gcttctgaaa 42 0 
ttgaagatgc cacagaaaag ggaggagaaa 48 0 
atgaagaaga agatcagaaa gaagatgaag 54 0 
gaaaagaaga caaagatgaa aaaggggaag 6 00 
agaaaggaga agatgcaaaa gagaaagaag 66 0 
atggagaaga tggaaaagag aaaggagaag 72 0 
caggagatgg aaaagagaat gaagatggaa 7 80 
atgtaaaagt caaagaagat gaaaaagaga 84 0 
atgaggaaga agctggaaaa gagaaagaag 900 
aagatgagat caaagaagat gatggaaaaa 96 0 
gccctatgta gtttcataat ttggtaacat 1020 
aaatattttt atcaaaaatt ttataaacac 108 0 
aacatcttca tattgattat tagccataaa 1140 
tttgtgatta tagtagtgga atacatagaa 12 0 0 
ttcgtgatgt gtagttatat gtcaatcttt 126 0 
agttcataag tgagggatcc aaaacaaggt 132 0 
taagaggtga ctatttcagt aatgtagggt 1380 

1391 
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<210> 60 

<211> 1125 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte Clone No. : 1596668 



<40Q> 60 

caacaaagca aggaagacgg agtccgagcc tcgggggctc ctagcaacgg gccggggcgg 60 
gagttccatg gagactgggg agcgcgcccg tctcatcctc atccttgtcc tccagcttct 120 
ccttcgcatc cgacgcaacc ggcagcagcg ctgccgccgc gtcctcagcc accgctccct 18 0 
cttcccacgg atgtgatctt cgtggtggaa agctaaattt taaaaccacc ccaatggatg 240 
cagacagtga tgttgcattg gacattctaa ttacaaatgt agtctgtgtt tttagaacaa 3 00 
gatgtcattt aaacttaagg aagattgctt tggaaggagc aaatgtaatt tataaacgtg 360 
atgttggaaa agtattaatg aagcttagaa aacctagaat tacagctaca atttggtcct 420 
caggaaaaat tatttgcact ggagcaacaa gtgaagaaga agctaaattt ggtgccagac 4 80 
gcttagcccg tagtctgcag aaactaggtt ttcaggtaat atttacagat tttaaggttg 540 
ttaacgttct ggcagtgtgt aacatgccat ttgaaatccg tttgccagaa ttcacaaaga 6 00 
acaatagacc tcatgccagt tacgaacctg aacttcatcc tgctgtgtgc tatcggataa 660 
aatctctaag agctacatta cagatttttt caacaggaag tatcacagta acagggccca 72 0 
atgtaaaggc tgttgctact gctgtggaac agatttaccc atttgtgttt gaaagcagga 78 0 
aagaaatttt ataattcacc acttaattgg ttagaatctc taactgagca ccttttaaac 840 
ctgctgcaca ttggactcaa aaggaaaact ggaccaacaa taattgagga aatagactct 900 
tttattcatt cacggctaca gtgtaagctc cagtcccttt ggattttatt ccaaaccttg 960 
ctgtaatata aaaggaagtt tacaagacat gatattgctg cttttacaaa aggacattct 1020 
atttattttc gcagtaattc tcatgtcccc ataagcagag ctgtcacagt gtgcactacc 1080 
ttagattgtt ttattgtcgt cattgttatt tttttccatt ttgag 1125 



<210> 61 

<211> 3073 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte Clone No. : 1977214 



<400> 61 

cggaattcgg ctcgagggcg actggagcgg ttccctcgca ggcggcgcca ttttgtgcta 6 0 
ggagcctgat aaaaccggcc cggttctgtg gaaagtgggc ggcggagcca gggtccctgg 120 
aatggcggag actctgtcag gcctaggtga ttctggagcg gcgggcgcgg cggctctgag 18 0 
ctccgcctcg tcagagaccg ggacgcggcg cctcagcgac ctgcgagtga tcgatctgcg 240 
ggcggagctg aggaaacgga atgtggactc gagcggcaac aagagcgttt tgatggagcg 3 00 
gctgaagaag gcaattgaag atgaaggtgg taatcctgac gaaattgaaa ttacctccga 36 0 
gggaaacaag aaaacatcaa agaggtctag caaagggcgc aaaccagaag aagagggtgt 42 0 
ggaagataac gggctggagg aaaactctgg ggatggacag gaggatgttg agaccagtct 4 80 
ggagaacttg caggacatcg acatcatgga tatcagtgtg ttggatgaag cagaaattga 540 
taatggaagc gttgcagatt gtgtcgaaga cgatgatgct gataacctcc aggagtccct 600 
gtcggatagt agagagctag tcgaggggga aatgaaagag cttccggagc agcttcagga 6 60 
acatgctata gaggacaaag aaactataaa caatttagat acttcatcat ctgacttcac 72 0 
tatattacag gaaattgaag agccatccct ggagccagaa aatgagaaaa tactcgacat 78 0 
tttgggggaa acttgtaaat ctgagccagt aaaagaagaa agttccgagc tggagcagcc 84 0 
atttgcacag gacacaagta gcgtggggcc agacagaaag cttgcggagg aagaggacct 900 
atttgacagc gcccatccgg aagagggtga tttagatttg gccagcgagt caacagcaca 96 0 
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cgctcagtcg 
gccaggcgat 
gagtagtgcg 
ggaagcccca 
cgcttgtaat 
gaaaatgagt 
tcgcagcagt 
agatttgaag 
tgcccggagt 
cacaaaatgc 
gaaagccaaa 
aaaggagaag 
cagaaaagat 
tgatcagaaa 
cgaacggact 
cgggtccaaa 
gtccgtcgtg 
tagcagggtg 
ggagcgtgag 
gcgggagcgc 
caggcgcgag 
gcgctatgag 
tgatgcctat 
taaccgccag 
ctcggtggac 
cccagaacgc 
ctatggctct 
acgtgactgg 

cgggggcatg 

tcactccggg 
cccaggcggg 
aggccagagc 
tggaatcctg 
ctgtgtaaaa 
tgtttttcgt 
taaaaaaaaa 



agcaaggcag 
ggcgagagga 
gcctccgagc 
agcccagaag 
gaagtccctc 
tctcccgaag 
tgtggtagaa 
aatcttttca 
cctggagctc 
attaaccacc 
aatgaacctg 
tcgagcaaca 
gatgctaaga 
cctggcccct 
gtagtaatgg 
gagagagctt 
tcctttgata 
cgtgaacgca 
cggctggaga 
atggagcggg 
caggagcgca 
caggagcggc 
tggccggaag 
gaccgcttcc 
aggagagaag 
catggaggac 
gacaagagga 
ggggaccatg 
atggacaggg 
cctggccaca 
gcctcccggg 
cgggggagca 
tgtcctgtct 
atattttttt 
tttggggttt 
aaa 



acagcctgtt 
cggactgtga 
tcgcggaggc 
ccagagatag 
cggctcctaa 
atgactcgga 
atttctgggt 
gcaaatatgg 
gctgttacgg 
tgcacaagac 
tgggaaagaa 
gtgacagatc 
agggtgacga 
cagagcgatc 
ataaatccaa 
ccaaaagcca 
aggtcaagga 
gtgaacgcga 
ttgcccgaga 
aacggctgga 
tccaccgtga 
ggcccgcggt 
ccaagcgggc 
acgactttga 
gttcaaggtc 
cagagcgcca 
tgagcgaggg 
gccgaagaga 
atcacaagag 
tgatgaaccg 
gccaccccat 
ggcccagcga 
cgtggcaaca 
ttaatctgct 
tttttttttg 



agcggtagtg 
gcctgtaggg 
ctctagcgag 
caaagaagac 
agagtcctca 
tacaaaaagg 
tagtggactc 
gaaggtggtg 
ttttgtcacg 
ggagctccac 
aacctctgac 
tacaaacctt 
cggaagtgga 
tcgagccaca 
aggggtgcct 
ggatcgcaaa 
gcctcggaag 
acaacgcatg 
gaggctggcc 
gcgcgaacgc 
gcgcgaggag 
gcggcggccc 
cgccctggat 
ccacagggac 
aatgatggga 
cggcggggcc 
ccgggggctg 
ggatgaccgg 
gtggcaaggt 
aggaggaatg 
cccacaeggt 
tgcccgcttc 
aggctatgtt 
gccatattgt 
taataaatgt 



aaaagggagc 
ctagagccgg 
gagctcgcag 
gggaggaagt 
accagtgagg 
ctttccaaag 
tcttctacaa 
ggcgccaagg 
atgtccacag 
ggaaagatga 
aaaagagaca 
aagagggatg 
gaaaagagta 
aagtcaggaa 
gtgattagtg 
tcagccagca 
tcaagagact 
caggcgcagt 
ttccagcgcc 
atgcacgtgg 
ctgaggcgcc 
tacgacctgg 
gagcgctacc 
cgcggccgct 
gaacgagaag 
tcccgcgatg 
cctcctcccc 
tcatggcagg 
ggcgagagaa 
tcagggcgcg 
ggcatgcagg 
actcgccgct 
ctgttaggag 
agctcaatac 
gtttccgttc 



ccgcggagca 
cagttgagca 
aagcacccac 
ttgattttga 
gcgctgatca 
aggaaaaggg 
ccagagctac 
ttgtgacaaa 
cagaagaggc 
tctccgtgga 
gtgacgggaa 
ataaatgtga 
aggaccaaga 
gtcgagggac 
taaaaacgtc 
gagagaagcg 
cagagtccca 
gggagcgcga 
agcggctgga 
agcacgacgg 
agcaggaact 
accggcgaga 
attctgactt 
accccgacca 
gacagcatta 
gctggggggg 
ccaggggcag 
gcacggccga 
gcatgtccgg 
gcagctttzgc 
gcgggtttgg 
actgagtact 
ttaccttaaa 
aatgtgaatt 
acataccctt 



1020 
1O80 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3073 



<210> 62 
<211> 1446 
<2X2> DNA 

<213 > Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte Clone No. : 2181282 

<400> 62 

agagtaaaaa atgatcccca agaattcaga gaccaaaagc tggggaccct gaaaaaatac 6 0 
cgtagcatta tgcccaaacc tatcatggtc atacccactt tggcctccct ggcttctcca 120 
actacactac agtcccagat gcttgggggc ctaggacagg atgttttgtt aaataattca 18 0 
ctcactccta aatatcttgg ctgtaagcaa gacaacagct cttcccctaa gcccagctcc 24 0 
gtgttcagaa atggattctc tggcattaag aagccttggc acagatgtca cgtctgcaac 3 00 
caccacttcc agttcaaaca gcaccttcga gaccacatga atacacacac caacagacgc 36 0 
ccttacagtt gtcggatttg tcgcaagtcc tatgtacgtc ctggcagcct gagcacacac 42 0 
atgaaacttc atcatggtga gaaccgtctg aagaaactca tgtgttgtga gttttgtgca 4 80 
aaagtgtttg gccacatccg agtctatttt ggccatctga aagaagtgca tagggttgtg 54 0 
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atcagcactg agcctgcgcc cagtgaactg 
atgagtgtgc gaggcatgga gggatcattg 
gacttccttc taaaccaggc agacgaagtc 
attactgctc agtcttttgc ggaaataaaa 
attgagggca ggctacaaga agggaccttc 
gtgcagcacg ctagccccga ctggaaaagg 
cattcctcct ccgaggaatc acatgcatgt 
cagaatggcg tggaaatgct catggaaaat 
agggaaacct gccagggccc tgagtgtcct 
tcaggcttta actgcctgct ttgtgcagag 
cactggaagc accagcataa ctgtgaggac 
gtctccaacc agggagtgat cgaactttcc 
gcctggggtt aaggagagag ctctgccgcc 
ggtgcttagt cacaaagatc aaacaacagg 
tacatggttt tattttctta acgaaataaa 
aaaaaa 



cagccaggag acataccaaa gaacagagac 6 00 
gagagggaaa acaagtccaa cctggaagaa 660 
aaattacaaa tcaaatgtgg tcgttgtcag 72 0 
tttcatttac ttgatgttca tggagaggaa 78 0 
ccaggaagca aggggactca ggaagagttg 84 0 
catcctgaga gagggaagcc ggagaaggtt 900 
ccaagactga aaaggcagct ccaccttcat 960 
gaaggacccc agtcaggaac caacaagcca 102 0 
ggcctccaca cgtttctctt gtggtcccat 1080 
atgctgggac ggaaagagga cctcctccac 114 0 
ccttccaaac tgtgggcfcat tttaaatacg 12 00 
agtgaagctg agaaatgaga ccccaaggca 1260 
accttccttc agagcttcgt gctttatggt 13 2 0 
attggtgtga gtgaacggaa atgatttttg 13 80 
atataagctc tcgaagcata tttttctaaa 144 0 

1446 
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